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(i) APPLICANT: BOIME, Irving 

MOYLE, William R. 

(ii) TITLE OF THE INVENTION: SINGLE-CHAIN FORMS OF THE 

GLYCOPROTEIN HORMONE QUARTET 

(iii) NUMBER OF SEQUENCES: 83 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: MORRISON & FOERSTER 

(B) STREET: 2000 Pennsylvania Avenue, NW, suite 5500 

(C) CITY: Washington 

(D) STATE: DC 

(E) COUNTRY: USA 

(F) ZIP: 20006-1888 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: DOS 

(D) SOFTWARE: FastSEQ for Windows Version 2.0 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/918,288 

(B) FILING DATE: 25 AUG-1997 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/853,524 

(B) FILING DATE: 09-MAY-1997 

(A) APPLICATION NUMBER: 08/199,382 

(B) FILING DATE: 18-FEB-1994 



(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Murashige, Kate H 

(B) REGISTRATION NUMBER: 29,959 

(C) REFERENCE/DOCKET NUMBER: 29500-20050.25 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 202-887-1500 

(B) TELEFAX: 202-887-0763 

(C) TELEX: 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 

(B) TYPE: amino acid 




(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

Ser Ser Ser Ser Lys Ala Pro Pro Pro Ser Leu Pro Ser Pro Ser Arg 

15 10 15 

Leu Pro Gly Pro Ser Asp Thr Pro lie Leu Pro Gin 
20 25 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 836 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME /KEY : Coding Sequence 

(B) LOCATION: 33. . .827 
(D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

ATGAAATCGA CGGAATCAGA CTCGAGCCAA GG ATG GAG ATG TTC CAG GGG CTG 53 

Met Glu Met Phe Gin Gly Leu 
1 5 

CTG CTG TTG CTG CTG CTG AGC ATG GGC GGG ACA TGG GCA TCC AAG GAG 101 
Leu Leu Leu Leu Leu Leu Ser Met Gly Gly Thr Trp Ala Ser Lys Glu 
10 15 20 

CCG CTT CGG CCA CGG TGC CGC CCC ATC AAT GCC ACC CTG GCT GTG GAG 14 9 

Pro Leu Arg Pro Arg Cys Arg Pro lie Asn Ala Thr Leu Ala Val Glu 
25 30 35 

AAG GAG GGC TGC CCC GTG TGC ATC ACC GTC AAC ACC ACC ATC TGT GCC 197 
Lys Glu Gly Cys Pro Val Cys lie Thr Val Asn Thr Thr lie Cys Ala 
40 45 50 55 

GGC TAC TGC CCC ACC ATG ACC CGC GTG CTG CAG GGG GTC CTG CCG GCC 245 
Gly Tyr Cys Pro Thr Met Thr Arg Val Leu Gin Gly Val Leu Pro Ala 
60 65 70 

CTG CCT CAG GTG GTG TGC AAC TAC CGC GAT GTG CGC TTC GAG TCC ATC 2 93 

Leu Pro Gin Val Val Cys Asn Tyr Arg Asp Val Arg Phe Glu Ser lie 
75 80 85 

CGG CTC CCT GGC TGC CCG CGC GGC GTG AAC CCC GTG GTC TCC TAC GCC 341 
Arg Leu Pro Gly Cys Pro Arg Gly Val Asn Pro Val Val Ser Tyr Ala 
90 95 100 

GTG GCT CTC AGC TGT CAA TGT GCA CTC TGC CGC CGC AGC ACC ACT GAC 389 
Val Ala Leu Ser Cys Gin Cys Ala Leu Cys Arg Arg Ser Thr Thr Asp 



2 



9 



105 110 115 

TGC GGG GGT CCC AAG GAC CAC CCC TTG ACC TGT GAT GAC CCC CGC TTC 437 
Cys Gly Gly Pro Lys Asp His Pro Leu Thr Cys Asp Asp Pro Arg Phe 
120 125 130 135 

CAG GAC TCC TCT TCC TCA AAG GCC CCT CCC CCC AGC CTT CCA AGC CCA 4 85 

Gin Asp Ser Ser Ser Ser Lys Ala Pro Pro Pro Ser Leu Pro Ser Pro 
140 145 150 

TCC CGA CTC CCG GGG CCC TCG GAC ACC CCG ATC CTC CCC CAA GGA TCC 533 
Ser Arg Leu Pro Gly Pro Ser Asp Thr Pro lie Leu Pro Gin Gly Ser 
155 160 165 

GGT AGC GGA TCT GGT AGC GCT CCT GAT GTG CAG GAT TGC CCA GAA TGC 581 
Gly Ser Gly Ser Gly Ser Ala Pro Asp Val Gin Asp Cys Pro Glu Cys 
170 175 180 

ACG CTA CAG GAA AAC CCA TTC TTC TCC CAG CCG GGT GCC CCA ATA CTT 629 
Thr Leu Gin Glu Asn Pro Phe Phe Ser Gin Pro Gly Ala Pro lie Leu 
185 190 195 

CAG TGC ATG GGC TGC TGC TTC TCT AGA GCA TAT CCC ACT CCA CTA AGG 677 
Gin Cys Met Gly Cys Cys Phe Ser Arg Ala Tyr Pro Thr Pro Leu Arg 
200 205 210 215 

TCC AAG AAG ACG ATG TTG GTC CAA AAG AAC GTC ACC TCA GAG TCC ACT 725 
Ser Lys Lys Thr Met Leu Val Gin Lys Asn Val Thr Ser Glu Ser Thr 
220 225 230 

TGC TGT GTA GCT AAA TCA TAT AAC AGG GTC ACA GTA ATG GGG GGT TTC 773 
Cys Cys Val Ala Lys Ser Tyr Asn Arg Val Thr Val Met Gly Gly Phe 
235 240 245 

AAA GTG GAG AAC CAC ACG GCG TGC CAC TGC AGT ACT TGT TAT TAT CAC 821 
Lys Val Glu Asn His Thr Ala Cys His Cys Ser Thr Cys Tyr Tyr His 
250 255 260 

AAA TCT TAAGGTACC 836 
Lys Ser 
265 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 265 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Met Glu Met Phe Gin Gly Leu Leu Leu Leu Leu Leu Leu Ser Met Gly 



(o3 




1 








5 










10 










15 




Gly 


Thr 


Trp 


Ala 


Ser 


Lys 


Glu 


Pro 


Leu 


Arg 


Pro 


Arg 


Cys 


Arg 


Pro 


lie 








20 










25 










30 






Asn 


Ala 


Thr 


Leu 


Ala 


Val 


Glu 


Lys 


Glu 


Gly 


Cys 


Pro 


Val 


Cys 


He 


Thr 






35 










40 










45 








Val 


Asn 


Thr 


Thr 


He 


Cys 


Ala 


Gly Tyr 


Cys 


Pro 


Thr 


Met 


Thr 


Arg 


Val 




50 










55 










60 










Leu 


Gin 


Gly Val 


Leu 


Pro 


Ala 


Leu 


Pro 


Gin 


Val 


Val 


Cys 


Asn 


Tyr 


Arg 


65 










70 










75 










80 


Asp 


Val 


Arg 


Phe 


Glu 


Ser 


He 


Arg 


Leu 


Pro 


Gly Cys 


Pro 


Arg 


Gly 


Val 










85 










90 










95 




Asn 


Pro 


Val 


Val 


Ser 


Tyr 


Ala 


Val 


Ala 


Leu 


Ser 


Cys 


Gin 


Cys 


Ala 


Leu 








100 










105 










110 






Cys 


Arg Arg 


Ser 


Thr 


Thr 


Asp 


Cys 


Gly 


Gly 


Pro 


Lys 


Asp 


His 


Pro 


Leu 






115 










120 










125 








Thr 


Cys 


Asp 


Asp 


Pro 


Arg 


Phe 


Gin 


Asp 


Ser 


Ser 


Ser 


Ser 


Lys 


Ala 


Pro 




130 










135 










140 










Pro 


Pro 


Ser 


Leu 


Pro 


Ser 


Pro 


Ser 


Arg 


Leu 


Pro 


Gly 


Pro 


Ser 


Asp 


Thr 


145 










150 










155 










160 


Pro 


He 


Leu 


Pro 


Gin 


Gly 


Ser 


Gly 


Ser 


Gly 


Ser 


Gly 


Ser 


Ala 


Pro 


Asp 










165 










170 










175 




Val 


Gin 


As P 


Cys 


Pro 


Glu 


Cys 


Thr 


Leu 


Gin 


Glu 


Asn 


Pro 


Phe 


Phe 


Ser 








180 










185 










190 






Gin 


Pro 


Gly Ala 


Pro 


He 


Leu 


Gin 


Cys 


Met 


Gly Cys 


Cys 


Phe 


Ser 


Arg 






195 










200 










205 








Ala 


Tyr 


Pro 


Thr 


Pro 


Leu 


Arg 


Ser 


Lys 


Lys 


Thr 


Met 


Leu 


Val 


Gin 


Lys 




210 










215 










220 










Asn 


Val 


Thr 


Ser 


Glu 


Ser 


Thr 


Cys 


Cys 


Val 


Ala 


Lys 


Ser 


Tyr 


Asn 


Arg 


225 










230 










235 










240 


Val 


Thr 


Val 


Met 


Gly 


Gly 


Phe 


Lys 


Val 


Glu 


Asn 


His 


Thr 


Ala 


Cys 


His 










245 










250 










255 




Cys 


Ser 


Thr 


Cys 


Tyr 


Tyr 


His 


Lys 


Ser 

















260 265 
(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 834 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

TCCGGATTAG CTTGAGATGG ATCCGGTACC TTAAGATTTG TGATAATAAC AAGTACTGCA 60 

GTGGCACGCC GTGTGGTTCT CCACTTTGAA ACCCCCCATT ACTGTGACCC TGTTATATGA 120 

TTTAGCTACA CAGCAAGTGG ACTCTGAGGT GACGTTCTTT TGGACCAACA TCGTCTTCTT 180 

GGACCTTAGT GGAGTGGGAT ATGCTCTAGA GAAGCAGCAG CCCATGCACT GAAGTATTGG 240 

GGCACCCGGC TGGGAGAAGA ATGGGTTTTC CTGTAGCGTG CATTCTGGGC AATCCTGCAC 300 

ATCAGGAGCG CTACCAGATC CGCTACCGGA TCCTTGGGGG AGGATCGGGG TGTCCGAGGG 360 

CCCCGGGAGT CGGGATGGGC TTGGAAGGCT GGGGGGAGGG GCCTTTGAGG AAGAGGAGTC 420 

CTGGAAGCGG GGGTCATCAC AGGTCAAGGG GTGGTCCTTG GGACCCCCGC AGTCAGTGGT 4 80 

GCTGCGGCGG CAGAGTGCAC ATTGACAGCT GAGAGCCACG GCGTAGGAGA CCACGGGGTT 54 0 

CACGCCGCGC GGGCAGCCAG GGAGCCGGAT GGACTCGAAG CGCACATCGC GGTAGTTGCA 600 

CACCACCTGA GGCAGGGCCG GCAGGACCCC CTGCAGCACG CGGGTCATGG TGGGGCAGTA 660 

GCCGGCACAG ATGGTGGTGT TGACGGTGAT GCACACGGGG CAGCCCTCCT TCTCCACAGC 720 



CAGGGTGGGA TTGATGGGGC GGCACCGTGG CCGAAGCGGC TCCTTGGATG CCCATGTCCC 780 
GCCCATGCTC AGCAGCAGCA ACAGCAGCAG CCCCTGGAAC ATCTCCATCC TTGG 834 

(2) INFORMATION FOR SEQ ID NO:5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 743 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME /KEY : Coding Sequence 

(B) LOCATION: 33... 734 
(D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

ATGAAATCGA CGGAATCAGA CTCGAGCCAA GG ATG GAG ATG TTC CAG GGG CTG 53 

Met Glu Met Phe Gin Gly Leu 
1 5 

CTG CTG TTG CTG CTG CTG AGC ATG GGC GGG ACA TGG GCA TCC AAG GAG 101 
Leu Leu Leu Leu Leu Leu Ser Met Gly Gly Thr Trp Ala Ser Lys Glu 
10 15 20 

CCG CTT CGG CCA CGG TGC CGC CCC ATC AAT GCC ACC CTG GCT GTG GAG 14 9 

Pro Leu Arg Pro Arg Cys Arg Pro lie Asn Ala Thr Leu Ala Val Glu 
25 30 35 

AAG GAG GGC TGC CCC GTG TGC ATC ACC GTC AAC ACC ACC ATC TGT GCC 197 
Lys Glu Gly Cys Pro Val Cys lie Thr Val Asn Thr Thr lie Cys Ala 
40 45 50 55 

GGC TAC TGC CCC ACC ATG ACC CGC GTG CTG CAG GGG GTC CTG CCG GCC 245 
Gly Tyr Cys Pro Thr Met Thr Arg Val Leu Gin Gly Val Leu Pro Ala 
60 65 70 

CTG CCT CAG GTG GTG TGC AAC TAC CGC GAT GTG CGC TTC GAG TCC ATC 293 
Leu Pro Gin Val Val Cys Asn Tyr Arg Asp Val Arg Phe Glu Ser lie 
75 80 85 

CGG CTC CCT GGC TGC CCG CGC GGC GTG AAC CCC GTG GTC TCC TAC GCC 341 
Arg Leu Pro Gly Cys Pro Arg Gly Val Asn Pro Val Val Ser Tyr Ala 
90 95 100 

GTG GCT CTC AGC TGT CAA TGT GCA CTC TGC CGC CGC AGC ACC ACT GAC 38 9 

Val Ala Leu Ser Cys Gin Cys Ala Leu Cys Arg Arg Ser Thr Thr Asp 
105 110 115 

TGC GGG GGT CCC AAG GAC CAC CCC TTG ACC TGT GAT GAC CCG CGG GGA 437 
Cys Gly Gly Pro Lys Asp His Pro Leu Thr Cys Asp Asp Pro Arg Gly 
120 125 130 135 

TCC GGT AGC GGA TCT GGT AGC GCT CCT GAT GTG CAG GAT TGC CCA GAA 485 
Ser Gly Ser Gly Ser Gly Ser Ala Pro Asp Val Gin Asp Cys Pro Glu 



5 

Us 




130 135 140 



Asp 


Val 


Gin 


Asp 


Cys 


Pro 


Glu 


Cys 


Thr 


Leu 


Gin 


Glu 


Asn 


Pro 


Phe 


Phe 


145 










150 










155 










160 


Ser 


Gin 


Pro 


Gly 


Ala 


Pro 


He 


Leu 


Gin 


Cys 


Met 


Gly 


Cys 


Cvs 


Phe 


Ser 










165 










170 










175 




Arg 


Ala 


Tyr 


Pro 


Thr 


Pro 


Leu 


Arg 


Ser 


Lys 


Lys 


Thr 


Met 


Leu 


Val 


Gin 








180 










185 










190 






Lys 


Asn 


Val 


Thr 


Ser 


Glu 


Ser 


Thr 


Cys 


Cys 


Val 


Ala 


Lys 


Ser 


Tyr 


Asn 






195 










200 










205 








Arg 


Val 


Thr 


Val 


Met 


Gly 


Gly 


Phe 


Lys 


Val 


Glu 


Asn 


His 


Thr 


Ala 


Cys 




210 










215 










220 










His 


Cys 


Ser 


Thr 


Cys 


Tyr 


Tyr 


His 


Lys 


Ser 















225 230 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 717 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

GGTACCTTAA GATTTGTGAT AATAACAAGT ACTGCAGTGG CACGCCGTGT GGTTCTCCAC 60 

TTTGAAACCC CCCATTACTG TGACCCTGTT ATATGATTTA GCTACACAGC AAGTGGACTC 120 

TGAGGTGACG TTCTTTTGGA CCAACATCGT CTTCTTGGAC CTTAGTGGAG TGGGATATGC 180 

TCTAGAGAAG CAGCAGCCCA TGCACTGAAG TATTGGGGCA CCCGGCTGGG AGAAGAATGG 24 0 

GTTTTCCTGT AGCGTGCATT CTGGGCAATC CTGCACATCA GGAGCGCTAC CAGATCCGCT 300 

ACCGGATCCC CGCGGGTCAT CACAGGTCAA GGGGTGGTCC TTGGGACCCC CGCAGTCAGT 360 

GGTGCTGCGG CGGCAGAGTG CACATTGACA GCTGAGAGCC ACGGCGTAGG AGACCACGGG 420 

GTTCACGCCG CGCGGGCAGC CAGGGAGCCG GATGGACTCG AAGCGCACAT CGCGGTAGTT 480 

GCACACCACC TGAGGCAGGG CCGGCAGGAC CCCCTGCAGC ACGCGGGTCA TGGTGGGGCA 540 

GTAGCCGGCA CAGATGGTGG TGTTGACGGT GATGCACACG GGGCAGCCCT CCTTCTCCAC 600 

AGCCAGGGTG GCATTGATGG GGCGGCACCG TGGCCGAAGC GGCTCCTTGG ATGCCCATGT 660 

CCCGCCCATG CTCAGCAGCA GCAACAGCAG CAGCCCCTGG AACATCTCCA TCCTTGG 717 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 744 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME /KEY : Coding Sequence 

(B) LOCATION: 34... 735 
(D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

ATGAAATCGA CGGAATCAGA CTCGAGCCAA GGA ATG GAG ATG CTC CAG GGG CTG 54 

Met Glu Met Leu Gin Gly Leu 
1 5 



7 



140 145 150 

TGC ACG CTA CAG GAA AAC CCA TTC TTC TCC CAG CCG GGT GCC CCA ATA 533 
Cys Thr Leu Gin Glu Asn Pro Phe Phe Ser Gin Pro Gly Ala Pro lie 
155 160 165 

CTT CAG TGC ATG GGC TGC TGC TTC TCT AGA GCA TAT CCC ACT CCA CTA 581 
Leu Gin Cys Met Gly Cys Cys Phe Ser Arg Ala Tyr Pro Thr Pro Leu 
170 175 180 

AGG TCC AAG AAG ACG ATG TTG GTC CAA AAG AAC GTC ACC TCA GAG TCC 629 
Arg Ser Lys Lys Thr Met Leu Val Gin Lys Asn Val Thr Ser Glu Ser 
185 190 195 

ACT TGC TGT GTA GCT AAA TCA TAT AAC AGG GTC ACA GTA ATG GGG GGT 677 
Thr Cys Cys Val Ala Lys Ser Tyr Asn Arg Val Thr Val Met Gly Gly 
200 205 210 215 

TTC AAA GTG GAG AAC CAC ACG GCG TGC CAC TGC AGT ACT TGT TAT TAT 725 
Phe Lys Val Glu Asn His Thr Ala Cys His Cys Ser Thr Cys Tyr Tyr 
220 225 230 

CAC AAA TCT TAAGGTACC 743 
His Lys Ser 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 234 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 



Met 


Glu 


Met 


Phe 


Gin 


Gly 


Leu 


Leu 


Leu 


Leu 


Leu 


Leu 


Leu 


Ser 


Met 


Gly 


1 








5 










10 










15 




Gly 


Thr 


Trp 


Ala 


Ser 


Lys 


Glu 


Pro 


Leu 


Arg 


Pro 


Arg 


Cys 


Arg 


Pro 


He 








20 










25 










30 






Asn 


Ala 


Thr 


Leu 


Ala 


Val 


Glu 


Lys 


Glu 


Gly 


Cys 


Pro 


Val 


Cys 


He 


Thr 






35 










40 










45 








Val 


Asn 


Thr 


Thr 


He 


Cys 


Ala 


Gly 


Tyr 


Cys 


Pro 


Thr 


Met 


Thr 


Arg 


Val 




50 










55 










60 










Leu 


Gin 


Gly 


Val 


Leu 


Pro 


Ala 


Leu 


Pro 


Gin 


Val 


Val 


Cys 


Asn 


Tyr 


Arg 


65 










70 










75 










80 


Asp 


Val 


Arg 


Phe 


Glu 


Ser 


He 


Arg 


Leu 


Pro 


Gly Cys 


Pro 


Arg 


Gly 


Val 










85 










90 










95 




Asn 


Pro 


Val 


Val 


Ser 


Tyr 


Ala 


Val 


Ala 


Leu 


Ser 


Cys 


Gin 


Cys 


Ala 


Leu 








100 










105 










110 






Cys 


Arg 


Arg 


Ser 


Thr 


Thr 


Asp 


Cys 


Gly 


Gly 


Pro 


Lys 


Asp 


His 


Pro 


Leu 






115 










120 










125 








Thr 


Cys 


Asp 


Asp 


Pro 


Arg 


Gly 


Ser 


Gly 


Ser 


Gly 


Ser 


Gly 


Ser 


Ala 


Pro 



6 



CTG CTG TTG 
Leu Leu Leu 
10 



CTG CTG CTG AGC ATG GGC GGG GCA TGG GCA TCC AGG GAG 
Leu Leu Leu Ser Met Gly Gly Ala Trp Ala Ser Arg Glu 
15 20 



102 



CCG CTT CGG 
Pro Leu Arg 
25 



CCA TGG TGC 
Pro Trp Cys 



CAC CCC ATC AAT 
His Pro lie Asn 
30 



GCC ATC 
Ala He 
35 



CTG GCT 
Leu Ala 



GTG GAG 
Val Glu 



150 



AAG GAG GGC 
Lys Glu Gly 
40 



TGC CCC GTG 
Cys Pro Val 
45 



TGC ATC ACC GTC 
Cys He Thr Val 



AAC ACC 
Asn Thr 
50 



ACC ATC 
Thr He 



TGT GCC 
Cys Ala 
55 



198 



GGC TAC TGC 
Gly Tyr Cys 



CCC ACC ATG ATG CGC GTG CTG CAG GCG GTC CTG CCG CCC 
Pro Thr Met Met Arg Val Leu Gin Ala Val Leu Pro Pro 
60 65 70 



246 



CTG CCT CAG 
Leu Pro Gin 



GTG GTG TGC 
Val Val Cys 
75 



ACC TAC CGT GAT 
Thr Tyr Arg Asp 
80 



GTG CGC TTC GAG TCC ATC 
Val Arg Phe Glu Ser He 
85 



294 



CGG CTC CCT 
Arg Leu Pro 
90 



GGC TGC CCG 
Gly Cys Pro 



CGT GGC GTG GAC 
Arg Gly Val Asp 
95 



CCC GTG GTC TCC TTC CCT 
Pro Val Val Ser Phe Pro 
100 



342 



GTG GCT CTC 
Val Ala Leu 
105 



AGC TGT CGC 
Ser Cys Arg 



TGT GGA CCC TGC 
Cys Gly Pro Cys 
110 



CGC CGC AGC ACC 
Arg Arg Ser Thr 
115 



TCT GAC 
Ser Asp 



390 



TGT GGG GGT 
Cys Gly Gly 
120 



CCC AAA GAC 
Pro Lys Asp 
125 



CAC CCC TTG ACC 
His Pro Leu Thr 



TGT GAC CAC CCC 
Cys Asp His Pro 
130 



CAA GGA 
Gin Gly 
135 



438 



TCC GGT AGC 
Ser Gly Ser 



GGA TCT GGT 
Gly Ser Gly 
140 



AGC GCT 
Ser Ala 



CCT 
Pro 



GAT 
Asp 
145 



GTG CAG GAT TGC 
Val Gin Asp Cys 



CCA GAA 
Pro Glu 
150 



486 



TGC ACG CTA 
Cys Thr Leu 



CAG GAA AAC 
Gin Glu Asn 
155 



CCA TTC 
Pro Phe 



TTC 
Phe 
160 



TCC 
Ser 



CAG CCG 
Gin Pro 



GGT GCC 
Gly Ala 
165 



CCA ATA 
Pro He 



534 



CTT CAG TGC 
Leu Gin Cys 
170 



ATG GGC TGC 
Met Gly Cys 



TGC TTC 
Cys Phe 
175 



TCT 
Ser 



AGA 
Arg 



GCA TAT 
Ala Tyr 



CCC ACT 
Pro Thr 
180 



CCA CTA 
Pro Leu 



582 



AGG TCC AAG 
Arg Ser Lys 
185 



AAG ACG ATG 
Lys Thr Met 



TTG GTC 
Leu Val 
190 



CAA AAG 
Gin Lys 



AAC GTC 
Asn Val 
195 



ACC TCA 
Thr Ser 



GAG TCC 
Glu Ser 



630 



ACT TGC TGT 
Thr Cys Cys 
200 



GTA GCT AAA 
Val Ala Lys 
205 



TCA TAT 
Ser Tyr 



AAC AGG 
Asn Arg 



GTC ACA 
Val Thr 
210 



GTA ATG 
Val Met 



GGG GGT 
Gly Gly 
215 



678 



TTC AAA GTG 
Phe Lys Val 



GAG AAC CAC ACG GCG TGC CAC TGC AGT ACT TGT TAT TAT 
Glu Asn His Thr Ala Cys His Cys Ser Thr Cys Tyr Tyr 
220 225 230 



726 




CAC AAA TCT TAAGGTACC 74 4 

His Lys Ser 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 234 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 



Met 


Glu 


Met 


Leu 


Gin 


Gly 


Leu 


Leu 


Leu 


Leu 


Leu 


Leu 


Leu 


Ser 


Met 


Gly 


1 








5 










10 










15 




Gly 


Ala 


Trp 


Ala 


Ser 


Arg 


Glu 


Pro 


Leu 


Arg 


Pro 


Trp 


Cys 


His 


Pro 


He 








20 










25 










30 






Asn 


Ala 


He 


Leu 


Ala 


Val 


Glu 


Lys 


Glu 


Gly Cys 


Pro 


Val 


Cys 


He 


Thr 






35 










40 










45 








Val 


Asn 


Thr 


Thr 


He 


Cys 


Ala 


Gly 


Tyr 


Cys 


Pro 


Thr 


Met 


Met 


Arg 


Val 




50 










55 










60 










Leu 


Gin 


Ala 


Val 


Leu 


Pro 


Pro 


Leu 


Pro 


Gin 


Val 


Val 


Cys 


Thr 


Tyr 


Arg 


65 










70 










75 










80 


Asp 


Val 


Arg 


Phe 


Glu 


Ser 


He 


Arg 


Leu 


Pro 


Gly 


Cys 


Pro 


Arg 


Gly Val 










85 










90 










95 




Asp 


Pro 


Val 


Val 


Ser 


Phe 


Pro 


Val 


Ala 


Leu 


Ser 


Cys 


Arg 


Cys 


Gly 


Pro 








100 










105 










110 






Cys 


Arg 


Arg 


Ser 


Thr 


Ser 


Asp 


Cys 


Gly 


Gly 


Pro 


Lys 


Asp 


His 


Pro 


Leu 






115 










120 










125 








Thr 


Cys 


Asp 


His 


Pro 


Gin 


Gly 


Ser 


Gly 


Ser 


Gly 


Ser 


Gly Ser 


Ala 


Pro 




130 










135 










140 










Asp 


Val 


Gin 


Asp 


Cys 


Pro 


Glu 


Cys 


Thr 


Leu 


Gin 


Glu 


Asn 


Pro 


Phe 


Phe 


145 










150 










155 










160 


Ser 


Gin 


Pro 


Gly 


Ala 


Pro 


He 


Leu 


Gin 


Cys 


Met 


Gly 


Cys 


Cys 


Phe 


Ser 










165 










170 










175 




Arg 


Ala 


Tyr 


Pro 


Thr 


Pro 


Leu 


Arg 


Ser 


Lys 


Lys 


Thr 


Met 


Leu 


Val 


Gin 








180 










185 










190 






Lys 


Asn 


Val 


Thr 


Ser 


Glu 


Ser 


Thr 


Cys 


Cys 


Val 


Ala 


Lys 


Ser 


Tyr 


Asn 






195 










200 










205 








Arg 


Val 


Thr 


Val 


Met 


Gly 


Gly 


Phe 


Lys 


Val 


Glu 


Asn 


His 


Thr 


Ala 


Cys 




210 










215 










220 










His 


Cys 


Ser 


Thr 


Cys 


Tyr 


Tyr 


His 


Lys 


Ser 















225 230 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 718 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



' (ft 




(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

GGTACCTTAA GATTTGTGAT AATAACAAGT ACTGCAGTGG CACGCCGTGT GGTTCTCCAC 60 

TTTGAAACCC CCCATTACTG TGACCCTGTT ATATGATTTA GCTACACAGC AAGTGGACTC 120 

TGAGGTGACG TTCTTTTGGA CCAACATCGT CTTCTTGGAC CTTAGTGGAG TGGGATATGC 180 

TCTAGAGAAG CAGCAGCCCA TGCACTGAAG TATTGGGGCA CCCGGCTGGG AGAAGAATGG 24 0 

GTTTTCCTGT AGCGTGCATT CTGGGCAATC CTGCACATCA GGAGCGCTAC CAGATCCGCT 300 

ACCGGATCCT TGGGGGTGGT CACAGGTCAA GGGGTGGTCT TTGGGACCCC CACAGTCAGA 360 

GGTGCTGCGG CGGCAGGGTC CACAGCGACA GCTGAGAGCC ACAGGGAAGG AGACCACGGG 420 

GTCCACGCCA CGCGGGCAGC CAGGGAGCCG GATGGACTCG AAGCGCACAT CACGGTAGGT 4 80 

GCACACCACC TGAGGCAGGG GCGGCAGGAC CGCCTGCAGC ACGCGCATCA TGGTGGGGCA 54 0 

GTAGCCGGCA CAGATGGTGG TGTTGACGGT GATGCACACG GGGCAGCCCT CCTTCTCCAC 600 

AGCCAGGATG GCATTGATGG GGTGGCACCA TGGCCGAAGC GGCTCCCTGG ATGCCCATGC 660 

CCCGCCCATG CTCAGCAGCA GCAACAGCAG CAGCCCCTGG AGCATCTCCA TTCCTTGG 718 



(2) INFORMATION FOR SEQ ID NO: 11: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 728 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(ix) FEATURE: 



(A) NAME /KEY : Coding Sequence 

(B) LOCATION: 33... 719 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 



ATGAAATCGA CGGAATCAGA CTCGAGCCAA GG ATG AAG ACA CTC CAG TTT TTC 53 

Met Lys Thr Leu Gin Phe Phe 
1 5 

TTC CTT TTC TGT TGC TGG AAA GCA ATC TGC TGC AAT AGC TGT GAG CTG 101 
Phe Leu Phe Cys Cys Trp Lys Ala lie Cys Cys Asn Ser Cys Glu Leu 
10 15 20 

ACC AAC ATC ACC ATT GCA ATA GAG AAA GAA GAA TGT CGT TTC TGC ATA 14 9 

Thr Asn lie Thr lie Ala lie Glu Lys Glu Glu Cys Arg Phe Cys lie 
25 30 35 

AGC ATC AAC ACC ACT TGG TGT GCT GGC TAC TGC TAC ACC AGG GAT CTG 197 
Ser lie Asn Thr Thr Trp Cys Ala Gly Tyr Cys Tyr Thr Arg Asp Leu 
40 45 50 55 

GTG TAT AAG GAC CCA GCC AGG CCC AAA ATC CAG AAA ACA TGT ACC TTC 24 5 

Val Tyr Lys Asp Pro Ala Arg Pro Lys lie Gin Lys Thr Cys Thr Phe 
60 65 70 

AAG GAA CTG GTA TAT GAA ACA GTG AGA GTG CCC GGC TGT GCT CAC CAT 293 
Lys Glu Leu Val Tyr Glu Thr Val Arg Val Pro Gly Cys Ala His His 
75 80 85 

GCA GAT TCC TTG TAT ACA TAC CCA GTG GCC ACC CAG TGT CAC TGT GGC 341 
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ID 



Ala Asp Ser Leu Tyr Thr Tyr Pro Val Ala Thr Gin Cys His Cys Gly 
90 95 100 

AAG TGT GAC AGC GAC AGC ACT GAT TGT ACT GTG CGA GGC CTG GGG CCC 389 
Lys Cys Asp Ser Asp Ser Thr Asp Cys Thr Val Arg Gly Leu Gly Pro 
105 110 115 

AGC TAC TGC TCC TTT GGT GAA ATG AAA GAA GGA TCC GGT AGC GGA TCT 437 
Ser Tyr Cys Ser Phe Gly Glu Met Lys Glu Gly Ser Gly Ser Gly Ser 
120 125 130 135 

GGT AGC GCT CCT GAT GTG CAG GAT TGC CCA GAA TGC ACG CTA CAG GAA 485 
Gly Ser Ala Pro Asp Val Gin Asp Cys Pro Glu Cys Thr Leu Gin Glu 
140 145 150 

AAC CCA TTC TTC TCC CAG CCG GGT GCC CCA ATA CTT CAG TGC ATG GGC 533 
Asn Pro Phe Phe Ser Gin Pro Gly Ala Pro lie Leu Gin Cys Met Gly 
155 160 165 

TGC TGC TTC TCT AGA GCA TAT CCC ACT CCA CTA AGG TCC AAG AAG ACG 581 
Cys Cys Phe Ser Arg Ala Tyr Pro Thr Pro Leu Arg Ser Lys Lys Thr 
170 175 180 

ATG TTG GTC CAA AAG AAC GTC ACC TCA GAG TCC ACT TGC TGT GTA GCT 629 
Met Leu Val Gin Lys Asn Val Thr Ser Glu Ser Thr Cys Cys Val Ala 
185 190 195 

AAA TCA TAT AAC AGG GTC ACA GTA ATG GGG GGT TTC AAA GTG GAG AAC 67 7 

Lys Ser Tyr Asn Arg Val Thr Val Met Gly Gly Phe Lys Val Glu Asn 
200 205 210 215 

CAC ACG GCG TGC CAC TGC AGT ACT TGT TAT TAT CAC AAA TCT TAAGGTACC 728 
His Thr Ala Cys His Cys Ser Thr Cys Tyr Tyr His Lys Ser 
220 225 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 229 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Met Lys Thr Leu Gin Phe Phe Phe Leu Phe Cys Cys Trp Lys Ala lie 

15 10 15 

Cys Cys Asn Ser Cys Glu Leu Thr Asn lie Thr lie Ala lie Glu Lys 

20 25 30 

Glu Glu Cys Arg Phe Cys lie Ser lie Asn Thr Thr Trp Cys Ala Gly 

35 40 45 

Tyr Cys Tyr Thr Arg Asp Leu Val Tyr Lys Asp Pro Ala Arg Pro Lys 
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728 






50 










55 










60 










He 


Gin 


Lys 


Thr 


Cys 


Thr 


Phe 


Lys 


Glu 


Leu 


Val 


Tyr 


Glu 


Thr 


Val 


Arq 


65 










70 










75 










80 


Val 


Pro 


Gly 


Cys 


Ala 


His 


His 


Ala 


Asp 


Ser 


Leu 


Tyr 


Thr 


Tyr 


Pro 


Val 










85 










90 










95 




Ala 


Thr 


Gin 


Cys 


His 


Cys 


Gly 


Lys 


Cys 


Asp 


Ser 


Asp 


Ser 


Thr 


Asp 


Cys 








100 










105 










110 






Thr 


Val 


Arg 


Gly Leu 


Gly 


Pro 


Ser 


Tyr 


Cys 


Ser 


Phe 


Gly 


Glu 


Met 


Lys 






115 










120 










125 








Glu 


Gly 


Ser 


Gly 


Ser 


Gly 


Ser 


Gly 


Ser 


Ala 


Pro Asp Val 


Gin 


Asp 


Cys 




130 










135 










140 










Pro 


Glu 


Cys 


Thr 


Leu 


Gin 


Glu 


Asn 


Pro 


Phe 


Phe 


Ser 


Gin 


Pro 


Gly 


Ala 


145 










150 










155 










160 


Pro 


He 


Leu 


Gin 


Cys 


Met 


Gly 


Cys 


Cys 


Phe 


Ser 


Arg Ala 


Tyr 


Pro 


Thr 










165 










170 










175 




Pro 


Leu Arg 


Ser 


Lys 


Lys 


Thr 


Met 


Leu 


Val 


Gin 


Lys 


Asn 


Val 


Thr 


Ser 








180 










185 










190 






Glu 


Ser 


Thr 


Cys 


Cys 


Val 


Ala 


Lys 


Ser 


Tyr Asn Arg 


Val 


Thr 


Val 


Met 






195 










200 










205 








Gly 


Gly 


Phe 


Lys 


Val 


Glu 


Asn 


His 


Thr 


Ala 


Cys 


His 


Cys 


Ser 


Thr 


Cys 




210 










215 










220 










Tyr 


Tyr 


His 


Lys 


Ser 
























225 

































(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 702 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

GGTACCTTAA GATTTGTGAT AATAACAAGT ACTGCAGTGG CACGCCGTGT GGTTCTCCAC 60 

TTTGAAACCC CCCATTACTG TGACCCTGTT ATATGATTTA GCTACACAGC AAGTGGACTC 120 

TGAGGTGACG TTCTTTTGGA CCAACATCGT CTTCTTGGAC CTTAGTGGAG TGGGATATGC 180 

TCTAGAGAAG CAGCAGCCCA TGCACTGAAG TATTGGGGCA CCCGGCTGGG AGAAGAATGG 240 

GTTTTCCTGT AGCGTGCATT CTGGGCAATC CTGCACATCA GGAGCGCTAC CAGATCCGCT 300 

ACCGGATCCT TCTTTCATTT CACCAAAGGA GCAGTAGCTG GGCCCCAGGC CTCGCACAGT 360 

ACAATCAGTG CTGTCGCTGT CACACTTGCC ACAGTGACAC TGGGTGGCCA CTGGGTATGT 420 

ATACAAGGAA TCTGCATGGT GAGCACAGCC GGGCACTCTC ACTGTTTCAT ATACCAGTTC 4 80 

CTTGAAGGTA CATGTTTTCT GGATTTTGGG CCTGGCTGGG TCCTTATACA CCAGATCCCT 54 0 

GGTGTAGCAG TAGCCAGCAC ACCAAGTGGT GTTGATGCTT ATGCAGAAAC GACATTCTTC 600 

TTTCTCTATT GCAATGGTGA TGTTGGTCAG CTCACAGCTA TTGCAGCAGA TTGCTTTCCA 660 

GCAACAGAAA AGGAAGAAAA ACTGGAGTGT CTTCATCCTT GG 702 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 752 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ix) FEATURE: 



12 




(A) NAME /KEY : Coding Sequence 

(B) LOCATION: 33. . .743 
(D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

ATGAAATCGA CGGAATCAGA CTCGAGCCAA GG ATG GAG ATG TTC CAG GGG CTG 53 

Met Glu Met Phe Gin Gly Leu 
1 5 

CTG CTG TTG CTG CTG CTG AGC ATG GGC GGG ACA TGG GCA TCC AAG GAG 101 
Leu Leu Leu Leu Leu Leu Ser Met Gly Gly Thr Trp Ala Ser Lys Glu 
10 15 20 

CCG CTT CGG CCA CGG TGC CGC CCC ATC AAT GCC ACC CTG GCT GTG GAG 14 9 

Pro Leu Arg Pro Arg Cys Arg Pro lie Asn Ala Thr Leu Ala Val Glu 
25 30 35 

AAG GAG GGC TGC CCC GTG TGC ATC ACC GTC AAC ACC ACC ATC TGT GCC 197 
Lys Glu Gly Cys Pro Val Cys lie Thr Val Asn Thr Thr lie Cys Ala 
40 45 50 55 

GGC TAG TGC CCC ACC ATG ACC CGC GTG CTG CAG GGG GTC CTG CCG GCC 24 5 

Gly Tyr Cys Pro Thr Met Thr Arg Val Leu Gin Gly Val Leu Pro Ala 
60 65 70 

CTG CCT CAG GTG GTG TGC AAC TAC CGC GAT GTG CGC TTC GAG TCC ATC 2 93 

Leu Pro Gin Val Val Cys Asn Tyr Arg Asp Val Arg Phe Glu Ser lie 
75 80 85 

CGG CTC CCT GGC TGC CCG CGC GGC GTG AAC CCC GTG GTC TCC TAC GCC 341 
Arg Leu Pro Gly Cys Pro Arg Gly Val Asn Pro Val Val Ser Tyr Ala 
90 95 100 

GTG GCT CTC AGC TGT CAA TGT GCA CTC TGC GAC AGC GAC AGC ACT GAT 389 
Val Ala Leu Ser Cys Gin Cys Ala Leu Cys Asp Ser Asp Ser Thr Asp 
105 110 115 

TGT ACT GTG CGA GGC CTG GGG CCC AGC TAC TGC TCC TTT GGT GAA ATG 437 
Cys Thr Val Arg Gly Leu Gly Pro Ser Tyr Cys Ser Phe Gly Glu Met 
120 125 130 135 

AAA GAA GGA TCC GGT AGC GGA TCT GGT AGC GCT CCT GAT GTG CAG GAT 485 
Lys Glu Gly Ser Gly Ser Gly Ser Gly Ser Ala Pro Asp Val Gin Asp 
140 145 150 

TGC CCA GAA TGC ACG CTA CAG GAA AAC CCA TTC TTC TCC CAG CCG GGT 533 
Cys Pro Glu Cys Thr Leu Gin Glu Asn Pro Phe Phe Ser Gin Pro Gly 
155 160 165 

GCC CCA ATA CTT CAG TGC ATG GGC TGC TGC TTC TCT AGA GCA TAT CCC 581 
Ala Pro lie Leu Gin Cys Met Gly Cys Cys Phe Ser Arg Ala Tyr Pro 
170 175 180 

ACT CCA CTA AGG TCC AAG AAG ACG ATG TTG GTC CAA AAG AAC GTC ACC 629 
Thr Pro Leu Arg Ser Lys Lys Thr Met Leu Val Gin Lys Asn Val Thr 
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185 190 195 

TCA GAG TCC ACT TGC TGT GTA GCT AAA TCA TAT AAC AGG GTC ACA GTA 677 
Ser Glu Ser Thr Cys Cys Val Ala Lys Ser Tyr Asn Arg Val Thr Val 
200 205 210 215 

ATG GGG GGT TTC AAA GTG GAG AAC CAC ACG GCG TGC CAC TGC AGT ACT 725 
Met Gly Gly Phe Lys Val Glu Asn His Thr Ala Cys His Cys Ser Thr 
220 225 230 

TGT TAT TAT CAC AAA TCT TAAGGTACC 752 
Cys Tyr Tyr His Lys Ser 
235 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 237 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 



Met 


Glu 


Met 


Phe 


Gin 


Gly 


Leu 


Leu 


Leu 


Leu 


Leu 


Leu 


Leu 


Ser 


Met 


Gly 


1 








5 










10 










15 




Gly 


Thr 


Trp 


Ala 


Ser 


Lys 


Glu 


Pro 


Leu 


Arg 


Pro 


Arg 


Cys 


Arg 


Pro 


He 








20 










25 










30 






Asn 


Ala 


Thr 


Leu 


Ala 


Val 


Glu 


Lys 


Glu 


Gly 


Cys 


Pro 


Val 


Cys 


He 


Thr 






35 










40 










45 








Val 


Asn 


Thr 


Thr 


He 


Cys 


Ala 


Gly 


Tyr 


Cys 


Pro 


Thr 


Met 


Thr 


Arg 


Val 




50 










55 










60 










Leu 


Gin 


Gly 


Val 


Leu 


Pro 


Ala 


Leu 


Pro 


Gin 


Val 


Val 


Cys 


Asn 


Tyr 


Arg 


65 










70 










75 










80 


Asp 


Val 


Arg 


Phe 


Glu 


Ser 


He 


Arg 


Leu 


Pro 


Gly Cys 


Pro 


Arg 


Gly 


Val 










85 










90 










95 




Asn 


Pro 


Val 


Val 


Ser 


Tyr 


Ala 


Val 


Ala 


Leu 


Ser 


Cys 


Gin 


Cys 


Ala 


Leu 








100 










105 










110 






Cys 


Asp 


Ser 


Asp 


Ser 


Thr 


Asp 


Cys 


Thr 


Val 


Arg 


Gly 


Leu 


Gly 


Pro 


Ser 






115 










120 










125 








Tyr 


Cys 


Ser 


Phe 


Gly 


Glu 


Met 


Lys 


Glu 


Gly 


Ser 


Gly 


Ser 


Gly 


Ser 


Gly 




130 










135 










140 










Ser 


Ala 


Pro 


Asp 


Val 


Gin 


Asp 


Cys 


Pro 


Glu 


Cys 


Thr 


Leu 


Gin 


Glu 


Asn 


145 










150 










155 










160 


Pro 


Phe 


Phe 


Ser 


Gin 


Pro 


Gly 


Ala 


Pro 


He 


Leu 


Gin 


Cys 


Met 


Gly 


Cys 










165 










170 










175 




Cys 


Phe 


Ser 


Arg 


Ala 


Tyr 


Pro 


Thr 


Pro 


Leu 


Arg 


Ser 


Lys 


Lys 


Thr 


Met 








180 










185 










190 






Leu 


Val 


Gin 


Lys 


Asn 


Val 


Thr 


Ser 


Glu 


Ser 


Thr 


Cys 


Cys 


Val 


Ala 


Lys 






195 










200 










205 








Ser 


Tyr 


Asn 


Arg 


Val 


Thr 


Val 


Met 


Gly 


Gly 


Phe 


Lys 


Val 


Glu 


Asn 


His 




210 










215 










220 










Thr 


Ala 


Cys 


His 


Cys 


Ser 


Thr 


Cys 


Tyr 


Tyr 


His 


Lys 


Ser 










225 230 235 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 726 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

GGTACCTTAA GATTTGTGAT AATAACAAGT ACTGCAGTGG CACGCCGTGT GGTTCTCCAC 60 

TTTGAAACCC CCCATTACTG TGACCCTGTT ATATGATTTA GCTACACAGC AAGTGGACTC 120 

TGAGGTGACG TTCTTTTGGA CCAACATCGT CTTCTTGGAC CTTAGTGGAG TGGGATATGC 180 

TCTAGAGAAG CAGCAGCCCA TGCACTGAAG TATTGGGGCA CCCGGCTGGG AGAAGAATGG 24 0 

GTTTTCCTGT AGCGTGCATT CTGGGCAATC CTGCACATCA GGAGCGCTAC CAGATCCGCT 300 

ACCGGATCCT' TCTTTCATTT CACCAAAGGA GCAGTAGCTG GGCCCCAGGC CTCGCACAGT 360 

ACAATCAGTG CTGTCGCTGT CGCAGAGTGC ACATTGACAG CTGAGAGCCA CGGCGTAGGA 4 20 

GACCACGGGG TTCACGCCGC GCGGGCAGCC AGGGAGCCGG ATGGACTCGA AGCGCACATC 480 

GCGGTAGTTG CACACCACCT GAGGCAGGGC CGGCAGGACC CCCTGCAGCA CGCGGGTCAT 54 0 

GGTGGGGCAG TAGCCGGCAC AGATGGTGGT GTTGACGGTG ATGCACACGG GGCAGCCCTC 600 

CTTCTCCACA GCCAGGGTGG CATTGATGGG GCGGCACCGT GGCCGAAGCG GCTCCTTGGA 660 

TGCCCATGTC CCGCCCATGC TCAGCAGCAG CAACAGCAGC AGCCCCTGGA ACATCTCCAT 720 

CCTTGG 72 6 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 752 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 33. . .743 
(D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

ATGAAATCGA CGGAATCAGA CTCGAGCCAA GG ATG GAG ATG TTC CAG GGG CTG 53 

Met Glu Met Phe Gin Gly Leu 
1 5 

CTG CTG TTG CTG CTG CTG AGC ATG GGC GGG ACA TGG GCA TCC AAG GAG 101 
Leu Leu Leu Leu Leu Leu Ser Met Gly Gly Thr Trp Ala Ser Lys Glu 
10 15 20 

CCG CTT CGG CCA CGG TGC CGC CCC ATC AAT GCC ACC CTG GCT GTG GAG 14 9 

Pro Leu Arg Pro Arg Cys Arg Pro lie Asn Ala Thr Leu Ala Val Glu 
25 30 35 

AAG GAG GGC TGC CCC GTG TGC ATC ACC GTC AAC ACC ACC ATC TGT GCC 197 
Lys Glu Gly Cys Pro Val Cys lie Thr Val Asn Thr Thr lie Cys Ala 
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40 45 50 55 

GGC TAC TGC CCC ACC ATG ACC CGC GTG CTG CAG GGG GTC CTG CCG GCC 245 
Gly Tyr Cys Pro Thr Met Thr Arg Val Leu Gin Gly Val Leu Pro Ala 
60 65 70 

CTG CCT CAG GTG GTG TGC AAC TAC CGC GAT GTG CGC TTC GAG TCC ATC 2 93 

Leu Pro Gin Val Val Cys Asn Tyr Arg Asp Val Arg Phe Glu Ser lie 
75 80 85 

CGG CTC CCT GGC TGC CCG CGC GGC GTG AAC CCC GTG GTC TCC TAC GCC 341 
Arg Leu Pro Gly Cys Pro Arg Gly Val Asn Pro Val Val Ser Tyr Ala 
90 95 100 

GTG GCT CTC AGC TGT CAA TGT GCA CTC TGC CGC CGC AGC ACC ACT GAC 389 
Val Ala Leu Ser Cys Gin Cys Ala Leu Cys Arg Arg Ser Thr Thr Asp 
105 110 115 

TGC ACT GTG CGA GGC CTG GGG CCC AGC TAC TGC TCC TTT GGT GAA ATG 437 
Cys Thr Val Arg Gly Leu Gly Pro Ser Tyr Cys Ser Phe Gly Glu Met 
120 125 130 135 

AAA GAA GGA TCC GGT AGC GGA TCT GGT AGC GCT CCT GAT GTG CAG GAT 4 85 

Lys Glu Gly Ser Gly Ser Gly Ser Gly Ser Ala Pro Asp Val Gin Asp 
140 145 150 

TGC CCA GAA TGC ACG CTA CAG GAA AAC CCA TTC TTC TCC CAG CCG GGT 533 
Cys Pro Glu Cys Thr Leu Gin Glu Asn Pro Phe Phe Ser Gin Pro Gly 
155 160 165 

GCC CCA ATA CTT CAG TGC ATG GGC TGC TGC TTC TCT AGA GCA TAT CCC 581 
Ala Pro lie Leu Gin Cys Met Gly Cys Cys Phe Ser Arg Ala Tyr Pro 
170 175 180 

ACT CCA CTA AGG TCC AAG AAG ACG ATG TTG GTC CAA AAG AAC GTC ACC 62 9 

Thr Pro Leu Arg Ser Lys Lys Thr Met Leu Val Gin Lys Asn Val Thr 
185 190 195 

TCA GAG TCC ACT TGC TGT GTA GCT AAA TCA TAT AAC AGG GTC ACA GTA 67 7 

Ser Glu Ser Thr Cys Cys Val Ala Lys Ser Tyr Asn Arg Val Thr Val 
200 205 210 215 

ATG GGG GGT TTC AAA GTG GAG AAC CAC ACG GCG TGC CAC TGC AGT ACT 725 
Met Gly Gly Phe Lys Val Glu Asn His Thr Ala Cys His Cys Ser Thr 
220 225 230 

TGT TAT TAT CAC AAA TCT TAAGGTACC 7 52 

Cys Tyr Tyr His Lys Ser 
235 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 237 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 



Met 


Glu 


Met 


Phe 


Gin 


Gly 


Leu 


Leu 


Leu 


Leu 


Leu 


Leu 


Leu 


Ser 


Met 


Gly 


1 








5 










10 










15 




Gly 


Thr 


Trp 


Ala 


Ser 


Lys 


Glu 


Pro 


Leu 


Arg 


Pro 


Arg 


Cys 


Arg 


Pro 


lie 








20 










25 










30 






Asn 


Ala 


Thr 


Leu 


Ala 


Val 


Glu 


Lys 


Glu 


Gly Cys 


Pro 


Val 


Cys 


He 


Thr 






35 










40 










45 








Val 


Asn 


Thr 


Thr 


He 


Cys 


Ala 


Gly 
w j. jr 


Tyr 


Cvs 


Pro 


Thr 


Met 


Thr 


Arg 


Val 




50 










55 










60 










Leu 


Gin 


Gly 


Val 


Leu 


Pro 


Ala 


Leu 


Pro 


Gin 


Val 


Val 


Cys 


Asn 


Tvr 


Ara 


65 










70 










75 










80 


Asp 


Val 


Arg 


Phe 


Glu 


Ser 


He 


Ara 


Leu 


Pro 


Gly 


Cys 


Pro Arg 


Glv 


Val 










85 










90 










95 




Asn 


Pro 


Val 


Val 


Ser 


Tvr 


Ala 


Val 


Ala 


Leu 


Ser 


Cys 


Gin 


Cys 


Ala 


Leu 








100 










105 










110 






Cvs 


Arg 


Arg 


Ser 


Thr 


Thr 


Asp 


Cys 


Thr 


Val 


Arg 


Glv 


Leu Gly 


Pro 


Ser 






115 










120 










125 








Tyr 


Cys 


Ser 


Phe 


Gly 


Glu 


Met 


Lys 


Glu 


Gly 


Ser 


Gly 


Ser 


Gly 


Ser 


Gly 




130 










135 










140 










Ser 


Ala 


Pro 


Asp 


Val 


Gin 


Asp 


Cys 


Pro 


Glu 


Cys 


Thr 


Leu 


Gin 


Glu 


Asn 


145 










150 










155 










160 


Pro 


Phe 


Phe 


Ser 


Gin 


Pro 


Gly 


Ala 


Pro 


He 


Leu 


Gin 


Cys 


Met 


Gly 


Cys 










165 










170 










175 




Cys 


Phe 


Ser 


Arg 


Ala 


Tyr 


Pro 


Thr 


Pro 


Leu 


Arg 


Ser 


Lys 


Lys 


Thr 


Met 








180 










185 










190 






Leu 


Val 


Gin 


Lys 


Asn 


Val 


Thr 


Ser 


Glu 


Ser 


Thr 


Cys 


Cys 


Val 


Ala 


Lys 






195 










200 










205 








Ser 


Tyr 


Asn 


Arg 


Val 


Thr 


Val 


Met 


Gly 


Gly 


Phe 


Lys 


Val 


Glu 


Asn 


His 




210 










215 










220 










Thr 


Ala 


Cys 


His 


Cys 


Ser 


Thr 


Cys 


Tyr 


Tyr 


His 


Lys 


Ser 









225 230 235 



(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 726 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

GGTACCTTAA GATTTGTGAT AATAACAAGT ACTGCAGTGG CACGCCGTGT GGTTCTCCAC 60 

TTTGAAACCC CCCATTACTG TGACCCTGTT ATATGATTTA GCTACACAGC AAGTGGACTC 120 

TGAGGTGACG TTCTTTTGGA CCAACATCGT CTTCTTGGAC CTTAGTGGAG TGGGATATGC 180 

TCTAGAGAAG CAGCAGCCCA TGCACTGAAG TATTGGGGCA CCCGGCTGGG AGAAGAATGG 24 0 

GTTTTCCTGT AGCGTGCATT CTGGGCAATC CTGCACATCA GGAGCGCTAC CAGATCCGCT 300 

ACCGGATCCT TCTTTCATTT CACCAAAGGA GCAGTAGCTG GGCCCCAGGC CTCGCACAGT 360 

GCAGTCAGTG -GTGCTGCGGC GGCAGAGTGC ACATTGACAG CTGAGAGCCA CGGCGTAGGA 4 20 

GACCACGGGG TTCACGCCGC GCGGGCAGCC AGGGAGCCGG ATGGACTCGA AGCGCACATC 4 80 
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GCGGTAGTTG CACACCACCT GAGGCAGGGC CGGCAGGACC CCCTGCAGCA CGCGGGTCAT 54 0 

GGTGGGGCAG TAGCCGGCAC AGATGGTGGT GTTGACGGTG ATGCACACGG GGCAGCCCTC 600 

CTTCTCCACA GCCAGGGTGG CATTGATGGG GCGGCACCGT GGCCGAAGCG GCTCCTTGGA 660 

TGCCCATGTC CCGCCCATGC TCAGCAGCAG CAACAGCAGC AGCCCCTGGA ACATCTCCAT 720 

CCTTGG 72 6 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 743 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 33. . .734 
(D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

ATGAAATCGA CGGAATCAGA CTCGAGCCAA GG ATG GAG ATG TTC CAG GGG CTG 53 

Met Glu Met Phe Gin Gly Leu 
1 5 

CTG CTG TTG CTG CTG CTG AGC ATG GGC GGG ACA TGG GCA TCC AAG GAG 101 
Leu Leu Leu Leu Leu Leu Ser Met Gly Gly Thr Trp Ala Ser Lys Glu 
10 15 20 

CCG CTT CGG CCA CGG TGC CGC CCC ATC AAT GCC ACC CTG GCT GTG GAG 14 9 

Pro Leu Arg Pro Arg Cys Arg Pro lie Asn Ala Thr Leu Ala Val Glu 
25 30 35 

AAG GAG GGC TGC CCC GTG TGC ATC ACC GTC AAC ACC ACC ATC TGT GCC 197 
Lys Glu Gly Cys Pro Val Cys lie Thr Val Asn Thr Thr lie Cys Ala 
40 45 50 55 

GGC TAC TGC CCC ACC ATG ACC CGC GTG CTG CAG GGG GTC CTG CCG GCC 245 
Gly Tyr Cys Pro Thr Met Thr Arg Val Leu Gin Gly Val Leu Pro Ala 
60 65 70 

CTG CCT CAG GTG GTG TGC AAC TAC CGC GAT GTG CGC TTC GAG TCC ATC 2 93 

Leu Pro Gin Val Val Cys Asn Tyr Arg Asp Val Arg Phe Glu Ser lie 
75 80 85 

CGG CTC CCT GGC TGC CCG CGC GGC GTG AAC CCC GTG GTC TCC TAC GCC 341 
Arg Leu Pro Gly Cys Pro Arg Gly Val Asn Pro Val Val Ser Tyr Ala 
90 95 100 

GTG GCT CTC AGC TGT CAA TGT GCA CTC TGC CGC CGC AGC ACC ACT GAC 389 
Val Ala Leu Ser Cys Gin Cys Ala Leu Cys Arg Arg Ser Thr Thr Asp 
105 110 115 

TGC ACT GTG CGA GGC CTG GGG CCC AGC TAC TGC TCC TTT GGT GAA GGA 4 37 

Cys Thr Val Arg Gly Leu Gly Pro Ser Tyr Cys Ser Phe Gly Glu Gly 
120 125 130 135 
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TCC GGT AGC GGA TCT GGT AGC GCT CCT GAT GTG CAG GAT TGC CCA GAA 4 85 

Ser Gly Ser Gly Ser Gly Ser Ala Pro Asp Val Gin Asp Cys Pro Glu 
140 145 150 

TGC ACG CTA CAG GAA AAC CCA TTC TTC TCC CAG CCG GGT GCC CCA ATA 533 
Cys Thr Leu Gin Glu Asn Pro Phe Phe Ser Gin Pro Gly Ala Pro lie 
155 160 165 

CTT CAG TGC ATG GGC TGC TGC TTC TCT AGA GCA TAT CCC ACT CCA CTA 581 
Leu Gin Cys Met Gly Cys Cys Phe Ser Arg Ala Tyr Pro Thr Pro Leu 
170 175 180 

AGG TCC AAG AAG ACG ATG TTG GTC CAA AAG AAC GTC ACC TCA GAG TCC 62 9 

Arg Ser Lys Lys Thr Met Leu Val Gin Lys Asn Val Thr Ser Glu Ser 
185 190 195 

ACT TGC TGT GTA GCT AAA TCA TAT AAC AGG GTC ACA GTA ATG GGG GGT 677 
Thr Cys Cys Val Ala Lys Ser Tyr Asn Arg Val Thr Val Met Gly Gly 
200 205 210 215 

TTC AAA GTG GAG AAC CAC ACG GCG TGC CAC TGC AGT ACT TGT TAT TAT 725 
Phe Lys Val Glu Asn His Thr Ala Cys His Cys Ser Thr Cys Tyr Tyr 
220 225 230 

CAC AAA TCT TAAGGTACC 743 
His Lys Ser 



(2) INFORMATION FOR SEQ ID NO:21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 234 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 



Met 


Glu 


Met 


Phe 


Gin 


Gly 


Leu 


Leu 


Leu 


Leu 


Leu 


Leu 


Leu 


Ser 


Met 


Gly 


1 








5 










10 










15 




Gly 


Thr 


Trp 


Ala 
20 


Ser 


Lys 


Glu 


Pro 


Leu 
25 


Arg 


Pro 


Arg 


Cys 


Arg 
30 


Pro 


He 


Asn 


Ala 


Thr 
35 


Leu 


Ala 


Val 


Glu 


Lys 
40 


Glu 


Gly 


Cys 


Pro 


Val 
45 


Cys 


He 


Thr 


Val 


Asn 
50 


Thr 


Thr 


He 


Cys 


Ala 
55 


Gly 


Tyr 


Cys 


Pro 


Thr 
60 


Met 


Thr 


Arg 


Val 


Leu 


Gin 


Gly 


Val 


Leu 


Pro 


Ala 


Leu 


Pro 


Gin 


Val 


Val 


Cys 


Asn 


Tyr 


Arg 


65 










70 










75 










80 


Asp 


Val 


Arg 


Phe 


Glu 
85 


Ser 


He 


Arg 


Leu 


Pro 
90 


Gly 


Cys 


Pro 


Arg 


Gly 
95 


Val 


Asn 


Pro 


Val 


Val 


Ser 


Tyr 


Ala 


Val 


Ala 


Leu 


Ser 


Cys 


Gin 


Cys 


Ala 


Leu 



100 105 110 





Cys 


Arg 


Arg 


Ser 


Thr 


Thr 


Sen 
no 




Thr 


v a. j. 




ui y 


XjC u 


ox y 


n u 


OCX 






115 










120 










125 








Tvr 
i yx 


Cys 


Ser 


Phe 


Gly 


OX Li. 


c^ m 
y 


OCX 


m w 

uiy 


JCi 


uiy 


Cpv- 


ox y 


OCX 


jtxx d 


Py*n 
riu 




130 










135 










140 

x i w 










Asp 


Val 


Gin 


Asp 


Cys 


Pro 


O-LU 




Thr 


XiC LA 


Gin 


OX LI 


Don 


n u 


php 

IT I1C 


ph o 


145 










150 










155 










160 


Ser 


Gin 


Pro 


Gly 


Ala 


Prn 


T 1 ^ 

1JLC 




OJ.ll 






ox y 


vys 


uyo 


XT 11C 


OCX 










165 










170 










175 




Arg 


Ala 


Tvr 

x y x 


Pro 


Thr 


Pro 


Leu 


Arg 


Ser 


Lys 


Lys 


Thr 


Met 


XI C u 


Val 


Oil 1 








180 










185 










190 






Lys 


Asn 


Val 


Thr 


Ser 


Glu 


Ser 


Thr 


Cys 


Cys 


Val 


Ala 


Lys 


Ser 


Tyr 


Asn 






195 










200 










205 








Arg 


Val 


Thr 


Val 


Met 


Gly 


Gly 


Phe 


Lys 


Val 


Glu 


Asn 


His 


Thr 


Ala 


Cys 




210 










215 










220 










His 


Cys 


Ser 


Thr 


Cys 


Tyr 


Tyr 


His 


Lys 


Ser 















225 230 * 



(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 717 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
{ D) TOPOLOGY: linear 



{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

GGTACCTTAA GATTTGTGAT AATAACAAGT ACTGCAGTGG CACGCCGTGT GGTTCTCCAC 60 

TTTGAAACCC CCCATTACTG TGACCCTGTT ATATGATTTA GCTACACAGC AAGTGGACTC 120 

TGAGGTGACG TTCTTTTGGA CCAACATCGT CTTCTTGGAC CTTAGTGGAG TGGGATATGC 180 

TCTAGAGAAG CAGCAGCCCA TGCACTGAAG TATTGGGGCA CCCGGCTGGG AGAAGAATGG 24 0 

GTTTTCCTGT AGCGTGCATT CTGGGCAATC CTGCACATCA GGAGCGCTAC CAGATCCGCT 300 

ACCGGATCCT TCACCAAAGG AGCAGTAGCT GGGCCCCAGG CCTCGCACAG TGCAGTCAGT 360 

GGTGCTGCGG CGGCAGAGTG CACATTGACA GCTGAGAGCC ACGGCGTAGG AGACCACGGG 420 

GTTCACGCCG CGCGGGCAGC CAGGGAGCCG GATGGACTCG AAGCGCACAT CGCGGTAGTT 4 80 

GCACACCACC TGAGGCAGGG CCGGCAGGAC CCCCTGCAGC ACGCGGGTCA TGGTGGGGCA 54 0 

GTAGCCGGCA CAGATGGTGG TGTTGACGGT GATGCACACG GGGCAGCCCT CCTTCTCCAC 600 

AGCCAGGGTG GCATTGATGG GGCGGCACCG TGGCCGAAGC GGCTCCTTGG ATGCCCATGT 660 

CCCGCCCATG CTCAGCAGCA GCAACAGCAG CAGCCCCTGG AACATCTCCA TCCTTGG 717 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 743 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

<ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 33... 734 
(D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 




ATGAAATCGA CGGAATCAGA CTCGAGCCAA GG 



ATG GAG ATG 
Met Glu Met 
1 



TTC CAG GGG CTG 
Phe Gin Gly Leu 
5 



53 



CTG CTG 
Leu Leu 



TTG CTG CTG 
Leu Leu Leu 
10 



CTG AGC 
Leu Ser 



ATG GGC 
Met Gly 
15 



GGG ACA TGG 
Gly Thr Trp 



GCA TCC AAG GAG 
Ala Ser Lys Glu 
20 



101 



CCG CTT 
Pro Leu 
25 



CGG CCA CGG 
Arg Pro Arg 



TGC CGC 
Cys Arg 
30 



CCC ATC 
Pro lie 



AAT 
Asn 



GCC ACC 
Ala Thr 
35 



CTG GCT GTG GAG 
Leu Ala Val Glu 



149 



AAG GAG 
Lys Glu 
40 



GGC TGC CCC 
Gly Cys Pro 



GTG TGC 
Val Cys 
45 



ATC ACC 
He Thr 



GTC 
Val 



AAC ACC 
Asn Thr 
50 



ACC 
Thr 



ATC 
He 



TGT GCC 
Cys Ala 
55 



197 



GGC TAC 
Gly Tyr 



TGC CCC ACC 
Cys Pro Thr 
60 



ATG ACC 
Met Thr 



CGC GTG 
Arg Val 



CTG 
Leu 
65 



CAG GGG 
Gin Gly 



GTC 
Val 



CTG 
Leu 



CCG GCC 
Pro Ala 
70 



245 



CTG CCT 
Leu Pro 



CAG GTG GTG 
Gin Val Val 
75 



TGC AAC 
Cys Asn 



TAC CGC 
Tyr Arg 
80 



GAT 
Asp 



GTG CGC 
Val Arg 



TTC 
Phe 



GAG 
Glu 
85 



TCC ATC 
Ser He 



293 



CGG CTC 
Arg Leu 



CCT GGC TGC 
Pro Gly Cys 
90 



CCG CGC 
Pro Arg 



GGC GTG 
Gly Val 
95 



AAC CCC GTG 
Asn Pro Val 



GTC 
Val 
100 



TCC 
Ser 



TAC GCC 
Tyr Ala 



341 



GTG GCT 
Val Ala 
105 



CTC AGC TGT 
Leu Ser Cys 



CAA TGT 
Gin Cys 
110 



GCA CTC 
Ala Leu 



TGC CGC CGC 
Cys Arg Arg 
115 



AGC 
Ser 



ACC ACT GAC 
Thr Thr Asp 



389 



TGC ACT 
Cys Thr 
120 



GTG CGA GGC 
Val Arg Gly 



CTG GGG 
Leu Gly 
125 



CCC AGC 
Pro Ser 



TAC TGC GAT 
Tyr Cys Asp 
130 



GAC CCG 
Asp Pro 



CGG GGA 
Arg Gly 
135 



437 



TCC GGT 
Ser Gly 



AGC GGA TCT 
Ser Gly Ser 
140 



GGT AGC 
Gly Ser 



GCT CCT 
Ala Pro 



GAT GTG CAG 
Asp Val Gin 
145 



GAT TGC 
Asp Cys 



CCA GAA 
Pro Glu 
150 



485 



TGC ACG 
Cys Thr 



CTA CAG GAA 
Leu Gin Glu 
155 



AAC CCA 
Asn Pro 



TTC TTC 
Phe Phe 
160 



TCC CAG CCG 
Ser Gin Pro 



GGT GCC 
Gly Ala 
165 



CCA ATA 
Pro He 



533 



CTT CAG 
Leu Gin 



TGC ATG GGC 
Cys Met Gly 
170 



TGC TGC 
Cys Cys 



TTC TCT 
Phe Ser 
175 



AGA GCA TAT 
Arg Ala Tyr 



CCC ACT 
Pro Thr 
180 



CCA CTA 
Pro Leu 



581 



AGG TCC 
Arg Ser 
185 



AAG AAG ACG 
Lys Lys Thr 



ATG TTG 
Met Leu 
190 



GTC CAA 
Val Gin 



AAG AAC GTC 
Lys Asn Val 
195 



ACC TCA 
Thr Ser 



GAG TCC 
Glu Ser 



629 



ACT TGC 
Thr Cys 
200 



TGT GTA GCT 
Cys Val Ala 



AAA TCA 
Lys Ser 
205 



TAT AAC 
Tyr Asn 



AGG GTC ACA 
Arg Val Thr 
210 



GTA ATG GGG GGT 
Val Met Gly Gly 
215 



677 



TTC AAA GTG GAG AAC CAC ACG GCG TGC CAC TGC AGT ACT TGT TAT TAT 



725 
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Phe Lys Val Glu Asn His Thr Ala Cys His Cys Ser Thr Cys Tyr Tyr 
220 225 230 

CAC AAA TCT TAAGGTACC 743 
His Lys Ser 



(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 234 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 



Met 


Glu 


Met 


Phe 


Gin 


Gly 


Leu 


Leu 


Leu 


Leu 


Leu 


Leu 


Leu 


Ser 


Met 


Gly 


1 








5 










10 










15 




Gly 


Thr 


Trp 


Ala 


Ser 


Lys 


Glu 


Pro 


Leu 


Arg 


Pro 


Arg 


Cys 


Arg 


Pro 


He 








20 










25 










30 






Asn 


Ala 


Thr 


Leu 


Ala 


Val 


Glu 


Lys 


Glu 


Gly 


Cys 


Pro 


Val 


Cys 


He 


Thr 






35 










40 










45 








Val 


Asn 


Thr 


Thr 


He 


Cys 


Ala 


Gly 


Tyr 


Cys 


Pro 


Thr 


Met 


Thr 


Arg 


Val 




50 










55 










60 










Leu 


Gin 


Gly 


Val 


Leu 


Pro 


Ala 


Leu 


Pro 


Gin 


Val 


Val 


Cys 


Asn 


Tyr 


Arg 


65 










70 










75 










80 


Asp 


Val 


Arg 


Phe 


Glu 


Ser 


He 


Arg 


Leu 


Pro 


Gly 


Cys 


Pro 


Arg 


Gly 


Val 










85 










90 










95 




Asn 


Pro 


Val 


Val 


Ser 


Tyr 


Ala 


Val 


Ala 


Leu 


Ser 


Cys 


Gin 


Cys 


Ala 


Leu 








100 










105 










110 






Cys 


Arg 


Arg 


Ser 


Thr 


Thr 


Asp 


Cys 


Thr 


Val 


Arg 


Gly 


Leu 


Gly 


Pro 


Ser 






115 










120 










125 








Tyr 


Cys 


Asp 


Asp 


Pro 


Arg 


Gly 


Ser 


Gly 


Ser 


Gly 


Ser 


Gly 


Ser 


Ala 


Pro 




130 










135 










140 










Asp 


Val 


Gin 


Asp 


Cys 


Pro 


Glu 


Cys 


Thr 


Leu 


Gin 


Glu 


Asn 


Pro 


Phe 


Phe 


145 










150 










155 










160 


Ser 


Gin 


Pro 


Gly 


Ala 


Pro 


He 


Leu 


Gin 


Cys 


Met 


Gly Cys 


Cys 


Phe 


Ser 










165 










170 










175 




Arg 


Ala 


Tyr 


Pro 


Thr 


Pro 


Leu 


Arg 


Ser 


Lys 


Lys 


Thr 


Met 


Leu 


Val 


Gin 








180 










185 










190 






Lys 


Asn 


Val 


Thr 


Ser 


Glu 


Ser 


Thr 


Cys 


Cys 


Val 


Ala 


Lys 


Ser 


Tyr 


Asn 






195 










200 










205 








Arg 


Val 


Thr 


Val 


Met 


Gly 


Gly 


Phe 


Lys 


Val 


Glu 


Asn 


His 


Thr 


Ala 


Cys 




210 










215 










220 










His 


Cys 


Ser 


Thr 


Cys 


Tyr 


Tyr 


His 


Lys 


Ser 















225 230 



(2) INFORMATION FOR SEQ ID NO:25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 717 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

GGTACCTTAA GATTTGTGAT AATAACAAGT ACTGCAGTGG CACGCCGTGT GGTTCTCCAC 60 

TTTGAAACCC CCCATTACTG TGACCCTGTT ATATGATTTA GCTACACAGC AAGTGGACTC 120 

TGAGGTGACG TTCTTTTGGA CCAACATCGT CTTCTTGGAC CTTAGTGGAG TGGGATATGC 180 

TCTAGAGAAG CAGCAGCCCA TGCACTGAAG TATTGGGGCA CCCGGCTGGG AGAAGAATGG 240 

GTTTTCCTGT AGCGTGCATT CTGGGCAATC CTGCACATCA GGAGCGCTAC CAGATCCGCT 300 

ACCGGATCCC CGCGGGTCAT CGCAGTAGCT GGGCCCCAGG CCTCGCACAG TGCAGTCAGT 360 

GGTGCTGCGG CGGCAGAGTG CACATTGACA GCTGAGAGCC ACGGCGTAGG AGACC AC GGG 4 20 

GTTCACGCCG CGCGGGCAGC CAGGGAGCCG GATGGACTCG AAGCGCACAT CGCGGTAGTT 4 80 

GCACACCACC TGAGGCAGGG CCGGCAGGAC CCCCTGCAGC ACGCGGGTCA TGGTGGGGCA 54 0 

GTAGCCGGCA CAGATGGTGG TGTTGACGGT GATGCACACG GGGCAGCCCT CCTTCTCCAC 600 

AGCCAGGGTG GCATTGATGG GGCGGCACCG TGGCCGAAGC GGCTCCTTGG ATGCCCATGT 660 

CCCGCCCATG CTCAGCAGCA GCAACAGCAG CAGCCCCTGG AACATCTCCA TCCTTGG 717 

(2) INFORMATION FOR SEQ ID NO:26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 719 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 33... 700 
(D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

ATGAAATCGA CGGAATCAGA CTCGAGCCAA GG ATG AAG ACA CTC CAG TTT TTC 53 

Met Lys Thr Leu Gin Phe Phe 
1 5 

TTC CTT TTC TGT TGC TGG AAA GCA ATC TGC TGC AAT AGC TGT GAG CTG 101 
Phe Leu Phe Cys Cys Trp Lys Ala lie Cys Cys Asn Ser Cys Glu Leu 
10 15 20 

ACC AAC ATC ACC ATT GCA ATA GAG AAA GAA GAA TGT CGT TTC TGC ATA 14 9 

Thr Asn lie Thr lie Ala lie Glu Lys Glu Glu Cys Arg Phe Cys lie 
25 30 35 

AGC ATC AAC ACC ACT TGG TGT GCT GGC TAC TGC TAC ACC AGG GAT CTG 197 
Ser lie Asn Thr Thr Trp Cys Ala Gly Tyr Cys Tyr Thr Arg Asp Leu 
40 45 50 55 

GTG TAT AAG GAC CCA GCC AGG CCC AAA ATC CAG AAA ACA TGT ACC TTC 24 5 

Val Tyr Lys Asp Pro Ala Arg Pro Lys lie Gin Lys Thr Cys Thr Phe 
60 65 70 

AAG GAA CTG GTA TAT GAA ACA GTG AGA GTG CCC GGC TGT GCT CAC CAT 293 
Lys Glu Leu Val Tyr Glu Thr Val Arg Val Pro Gly Cys Ala His His 
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75 80 85 

GCA GAT TCC TTG TAT ACA TAC CCA GTG GCC ACC CAG TGT CAC TGT GGC 341 
Ala Asp Ser Leu Tyr Thr Tyr Pro Val Ala Thr Gin Cys His Cys Gly 
90 95 100 

AAG TGT GAC AGC GAC AGC ACT GAT TGT ACT GTG CGA GGC CTG GGG CCC 389 
Lys Cys Asp Ser Asp Ser Thr Asp Cys Thr Val Arg Gly Leu Gly Pro 
105 110 115 

AGC TAC TGC TCC TTT GGT GAA GGA TCC GGT AGC GGA TCT GGT AGC GCT 4 37 

Ser Tyr Cys Ser Phe Gly Glu Gly Ser Gly Ser Gly Ser Gly Ser Ala 
120 125 130 135 

CCT GAT GTG CAG GAT TGC CCA GAA TGC ACG CTA CAG GAA AAC CCA TTC 4 85 

Pro Asp Val Gin Asp Cys Pro Glu Cys Thr Leu Gin Glu Asn Pro Phe 
140 145 150 

TTC TCC CAG CCG GGT GCC CCA ATA CTT CAG TGC ATG GGC TGC TGC TTC 533 
Phe Ser Gin Pro Gly Ala Pro lie Leu Gin Cys Met Gly Cys Cys Phe 
155 160 165 

TCT AGA GCA TAT CCC ACT CCA CTA AGG TCC AAG AAG ACG ATG TTG GTC 581 
Ser Arg Ala Tyr Pro Thr Pro Leu Arg Ser Lys Lys Thr Met Leu Val 
170 175 180 

CAA AAG AAC GTC ACC TCA GAG TCC ACT TGC TGT GTA GCT AAA TCA TAT 629 
Gin Lys Asn Val Thr Ser Glu Ser Thr Cys Cys Val Ala Lys Ser Tyr 
185 190 195 

AAC AGG GTC ACA GTA ATG GGG GGT TTC AAA GTG GAG AAC CAC ACG GCG 677 
Asn Arg Val Thr Val Met Gly Gly Phe Lys Val Glu Asn His Thr Ala 
200 205 210 215 

TGC CAC TGC AGT ACT TGT TAT TA TCACAAATCT TAAGGTACC 719 
Cys His Cys Ser Thr Cys Tyr Tyr 
220 



(2) INFORMATION FOR SEQ ID NO:27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 223 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

He 
Lys 
Gly 
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Met Lys Thr Leu Gin Phe Phe 

1 5 
Cys Cys Asn Ser Cys Glu Leu 
20 

Glu Glu Cys Arg Phe Cys He 



Phe Leu Phe Cys Cys Trp Lys Ala 

10 15 
Thr Asn He Thr He Ala He Glu 

25 30 
Ser He Asn Thr Thr Trp Cys Ala 



35 40 45 



Tyr 


Cys 
50 


Tyr 


Thr 


Arg 


Asp 


Leu 
55 


Val 


Tyr 


Lys 


Asp 


Pro 
60 


Ala 


Arg 


Pro 


Lys 


He 


Gin 


Lys 


Thr 


Cys 


Thr 


Phe 


Lys 


Glu 


Leu 


Val 


Tyr 


Glu 


Thr 


Val 


Arg 


65 










70 










75 










80 


Val 


Pro 


Gly 


Cys 


Ala 
85 


His 


His 


Ala 


Asp 


Ser 
90 


Leu 


Tyr 


Thr 


Tyr 


Pro 
95 


Val 


Ala 


Thr 


Gin 


Cys 
100 


His 


Cys 


Gly 


Lys 


Cys 
105 


Asp 


Ser 


Asp 


Ser 


Thr 
110 


Asp 


Cys 


Thr 


Val 


Arg 
115 


Gly 


Leu 


Gly 


Pro 


Ser 
120 


Tyr 


Cys 


Ser 


Phe 


Gly 
125 


Glu 


Gly 


Ser 


Gly 


Ser 
130 


Gly 


Ser 


Gly 


Ser 


Ala 
135 


Pro 


Asp 


Val 


Gin 


Asp 
140 


Cys 


Pro 


Glu 


Cys 


Thr 


Leu 


Gin 


Glu 


Asn 


Pro 


Phe 


Phe 


Ser 


Gin 


Pro 


Gly Ala 


Pro 


He 


Leu 


145 










150 










155 










160 


Gin 


Cys 


Met 


Gly Cys 


Cys 


Phe 


Ser 


Arg 


Ala 


Tyr 


Pro 


Thr 


Pro 


Leu 


Arg 










165 










170 










175 




Ser 


Lys 


Lys 


Thr 
180 


Met 


Leu 


Val 


Gin 


Lys 
185 


Asn 


Val 


Thr 


Ser 


Glu 
190 


Ser 


Thr 


Cys 


Cys 


Val 
195 


Ala 


Lys 


Ser 


Tyr 


Asn 
200 


Arg 


Val 


Thr 


Val 


Met 
205 


Gly 


Gly 


Phe 


Lys 


Val 
210 


Glu 


Asn 


His 


Thr 


Ala 
215 


Cys 


His 


Cys 


Ser 


Thr 
220 


Cys 


Tyr 


Tyr 





(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 693 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

GGTACCTTAA GATTTGTGAT AATAACAAGT ACTGCAGTGG CACGCCGTGT GGTTCTCCAC 60 

TTTGAAACCC CCCATTACTG TGACCCTGTT ATATGATTTA GCTACACAGC AAGTGGACTC 120 

TGAGGTGACG TTCTTTTGGA CCAACATCGT CTTCTTGGAC CTTAGTGGAG TGGGATATGC 18 0 

TCTAGAGAAG CAGCAGCCCA TGCACTGAAG TATTGGGGCA CCCGGCTGGG AGAAGAATGG 24 0 

GTTTTCCTGT AGCGTGCATT CTGGGCAATC CTGCACATCA GGAGCGCTAC CAGATCCGCT 300 

ACCGGATCCT TCACCAAAGG AGCAGTAGCT GGGCCCCAGG CCTCGCACAG TACAATCAGT 360 

GCTGTCGCTG TCACACTTGC CACAGTGACA CTGGGTGGCC ACTGGGTATG TATACAAGGA 420 

ATCTGCATGG TGAGCACAGC CGGGCACTCT CACTGTTTCA TATACCAGTT CCTTGAAGGT 4 80 

ACATGTTTTC TGGATTTTGG GCCTGGCTGG' GTCCTTATAC ACCAGATCCC TGGTGTAGCA 54 0 

GTAGCCAGCA CACCAAGTGG TGTTGATGCT TATGCAGAAA CGACATTCTT CTTTCTCTAT 600 

TGCAATGGTG ATGTTGGTCA GCTCACAGCT ATTGCAGCAG ATTGCTTTCC AGCAACAGAA 660 

AAGGAAGAAA AACTGGAGTG TCTTCATCCT TGG 693 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 707 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ix) FEATURE: 



25 



1 



(A) NAME /KEY : Coding Sequence 

(B) LOCATION: 33... 698 
(D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

ATGAAATCGA CGGAATCAGA CTCGAGCCAA GG ATG AAG ACA CTC CAG TTT TTC 53 

Met Lys Thr Leu Gin Phe Phe 
1 5 

TTC CTT TTC TGT TGC TGG AAA GCA ATC TGC TGC AAT AGC TGT GAG CTG 101 
Phe Leu Phe Cys Cys Trp Lys Ala lie Cys Cys Asn Ser Cys Glu Leu 
10 15 20 

ACC AAC ATC ACC ATT GCA ATA GAG AAA GAA GAA TGT CGT TTC TGC ATA 14 9 

Thr Asn lie Thr lie Ala lie Glu Lys Glu Glu Cys Arg Phe Cys lie 
25 30 35 

AGC ATC AAC ACC ACT TGG TGT GCT GGC TAC TGC TAC ACC AGG GAT CTG 197 
Ser lie Asn Thr Thr Trp Cys Ala Gly Tyr Cys Tyr Thr Arg Asp Leu 
40 45 50 55 

GTG TAT AAG GAC CCA GCC AGG CCC AAA ATC CAG AAA ACA TGT ACC TTC 245 
Val Tyr Lys Asp Pro Ala Arg Pro Lys lie Gin Lys Thr Cys Thr Phe 
60 65 70 

AAG GAA CTG GTA TAT GAA ACA GTG AGA GTG CCC GGC TGT GCT CAC CAT 2 93 

Lys Glu Leu Val Tyr Glu Thr Val Arg Val Pro Gly Cys Ala His His 
75 80 85 

GCA GAT TCC TTG TAT ACA TAC CCA GTG GCC ACC CAG TGT CAC TGT GGC 341 
Ala Asp Ser Leu Tyr Thr Tyr Pro Val Ala Thr Gin Cys His Cys Gly 
90 95 100 

AAG TGT GAC AGC GAC AGC ACT GAT TGT ACT GTG CGA GGC CTG GGG CCC 38 9 

Lys Cys Asp Ser Asp Ser Thr Asp Cys Thr Val Arg Gly Leu Gly Pro 
105 110 115 

AGC TAC TGC GGA TCC GGT AGC GGA TCT GGT AGC GCT CCT GAT GTG CAG 437 
Ser Tyr Cys Gly Ser Gly Ser Gly Ser Gly Ser Ala Pro Asp Val Gin 
120 125 130 135 

GAT TGC CCA GAA TGC ACG CTA CAG GAA AAC CCA TTC TTC TCC CAG CCG 4 85 

Asp Cys Pro Glu Cys Thr Leu Gin Glu Asn Pro Phe Phe Ser Gin Pro 
140 145 150 

GGT GCC CCA ATA CTT CAG TGC ATG GGC TGC TGC TTC TCT AGA GCA TAT 533 
Gly Ala Pro lie Leu Gin Cys Met Gly Cys Cys Phe Ser Arg Ala Tyr 
155 160 165 

CCC ACT CCA CTA AGG TCC AAG AAG ACG ATG TTG GTC CAA AAG AAC GTC 581 
Pro Thr Pro Leu Arg Ser Lys Lys Thr Met Leu Val Gin Lys Asn Val 
170 175 180 

ACC TCA GAG TCC ACT TGC TGT GTA GCT AAA TCA TAT AAC AGG GTC ACA 629 
Thr Ser Glu Ser Thr Cys Cys Val Ala Lys Ser Tyr Asn Arg Val Thr 
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185 190 195 

GTA ATG GGG GGT TTC AAA GTG GAG AAC CAC ACG GCG TGC CAC TGC AGT 677 
Val Met Gly Gly Phe Lys Val Glu Asn His Thr Ala Cys His Cys Ser 
200 205 210 215 

ACT TGT TAT TAT CAC AAA TCT TAAGGTACC 707 
Thr Cys Tyr Tyr His Lys Ser 
220 



(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 222 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 



Met 


Lys 


Thr 


Leu 


Gin 


Phe 


Phe 


Phe 


Leu 


Phe 


Cys 


Cys 


Trp 


Lys 


Ala 


He 


1 








5 










10 










15 




Cys 


Cys 


Asn 


Ser 
20 


Cys 


Glu 


Leu 


Thr 


Asn 
25 


He 


Thr 


He 


Ala 


He 
30 


Glu 


Lys 


Glu 


Glu 


Cys 
35 


Arg 


Phe 


Cys 


He 


Ser 
40 


He 


Asn 


Thr 


Thr 


Trp 
45 


Cys 


Ala 


Gly 


Tyr 


Cys 
50 


Tyr 


Thr 


Arg 


Asp 


Leu 
55 


Val 


Tyr 


Lys 


Asp 


Pro 
60 


Ala 


Arg 


Pro 


Lys 


He 


Gin 


Lys 


Thr 


Cys 


Thr 


Phe 


Lys 


Glu 


Leu 


Val 


Tyr 


Glu 


Thr 


Val 


Arg 


65 










70 










75 










80 


Val 


Pro 


Gly 


Cys 


Ala 
85 


His 


His 


Ala 


Asp 


Ser 
90 


Leu 


Tyr 


Thr 


Tyr 


Pro 
95 


Val 


Ala 


Thr 


Gin 


Cys 


His 


Cys 


Gly 


Lys 


Cys 


Asp 


Ser 


Asp 


Ser 


Thr Asp 


Cys 








100 










105 










110 






Thr 


Val 


Arg 


Gly 


Leu 


Gly 


Pro 


Ser 


Tyr 


Cys 


Gly 


Ser 


Gly Ser 


Gly 


Ser 






115 










120 










125 








Gly 


Ser 
130 


Ala 


Pro 


Asp 


Val 


Gin 
135 


Asp 


Cys 


Pro 


Glu 


Cys 
140 


Thr 


Leu 


Gin 


Glu 


Asn 


Pro 


Phe 


Phe 


Ser 


Gin 


Pro 


Gly 


Ala 


Pro 


He 


Leu 


Gin 


Cys 


Met 


Gly 


145 










150 










155 










160 


Cys 


Cys 


Phe 


Ser 


Arg 
165 


Ala 


Tyr 


Pro 


Thr 


Pro 
170 


Leu 


Arg 


Ser 


Lys 


Lys 
175 


Thr 


Met 


Leu 


Val 


Gin 
180 


Lys 


Asn 


Val 


Thr 


Ser 
185 


Glu 


Ser 


Thr 


Cys 


Cys 
190 


Val 


Ala 


Lys 


Ser 


Tyr 
195 


Asn 


Arg 


Val 


Thr 


Val 
200 


Met 


Gly 


Gly 


Phe 


Lys 
205 


Val 


Glu 


Asn 


His 


Thr 
210 


Ala 


Cys 


His 


Cys 


Ser 
215 


Thr 


Cys 


Tyr 


Tyr 


His 
220 


Lys 


Ser 







(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 681 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

GGTACCTTAA GATTTGTGAT AATAACAAGT ACTGCAGTGG CACGCCGTGT GGTTCTCCAC 60 

TTTGAAACCC CCCATTACTG TGACCCTGTT ATATGATTTA GCTACACAGC AAGTGGACTC 120 

TGAGGTGACG TTCTTTTGGA CCAACATCGT CTTCTTGGAC CTTAGTGGAG TGGGATATGC 180 

TCTAGAGAAG CAGCAGCCCA TGCACTGAAG TATTGGGGCA CCCGGCTGGG AGAAGAATGG 24 0 

GTTTTCCTGT AGCGTGCATT CTGGGCAATC CTGCACATCA GGAGCGCTAC CAGATCCGCT 300 

ACCGGATCCG CAGTAGCTGG GCCCCAGGCC TCGCACAGTA CAATCAGTGC TGTCGCTGTC 360 

ACACTTGCCA CAGTGACACT GGGTGGCCAC TGGGTATGTA TACAAGGAAT CTGCATGGTG 420 

AGCACAGCCG GGCACTCTCA CTGTTTCATA TACCAGTTCC TTGAAGGTAC ATGTTTTCTG 4 80 

GATTTTGGGC CTGGCTGGGT CCTTATACAC. CAGATCCCTG GTGTAGCAGT AGCCAGCACA 540 

CCAAGTGGTG TTGATGCTTA TGCAGAAACG. ACATTCTTCT TTCTCTATTG CAATGGTGAT 600 

GTTGGTCAGC" TCACAGCTAT TGCAGCAGAT TGCTTTCCAG CAACAGAAAA GGAAGAAAAA 660 

CTGGAGTGTC TTCATCCTTG G 681 

(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 312 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1...303 
(D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

TGC GGA TCC GGT AGC GGA TCT GGT AGC GCT CCT GAT GTG CAG GAT TGC 4 8 

Cys Gly Ser Gly Ser Gly Ser Gly Ser Ala Pro Asp Val Gin Asp Cys 
15 10 15 

CCA GAA TGC ACG CTA CAG GAA AAC CCA TTC TTC TCC CAG CCG GGT GCC 96 
Pro Glu Cys Thr Leu Gin Glu Asn Pro Phe Phe Ser Gin Pro Gly Ala 
20 25 30 

CCA ATA CTT CAG TGC ATG GGC TGC TGC TTC TCT AGA GCA TAT CCC ACT 14 4 

Pro lie Leu Gin Cys Met Gly Cys Cys Phe Ser Arg Ala Tyr Pro Thr 
35 40 45 

CCA CTA AGG TCC AAG AAG ACG ATG TTG GTC CAA AAG CAA GTC ACC TCA 192 
Pro Leu Arg Ser Lys Lys Thr Met Leu Val Gin Lys Gin Val Thr Ser 
50 55 60 

GAG TCC ACT TGC TGT GTA GCT AAA TCA TAT AAC AGG GTC ACA GTA ATG 24 0 

Glu Ser Thr Cys Cys Val Ala Lys Ser Tyr Asn Arg Val Thr Val Met 
65 70 75 80 

GGG GGT TTC AAA^ GTG GAG CAA CAC ACG GCG TGC CAC TGC AGT ACT TGT 288 
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Gly Gly Phe Lys Val Glu Gin His Thr Ala Cys His Cys Ser Thr Cys 
85 90 95 

TAT TAT CAC AAA TCT TAAGGTACC 312 
Tyr Tyr His Lys Ser 
100 



(2) INFORMATION FOR SEQ ID NO:33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 101 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:33: 



Cys 


Gly 


Ser 


Gly 


Ser 


Gly 


Ser 


Gly 


Ser 


Ala 


Pro 


Asp 


Val 


Gin 


Asp Cys 


1 








5 










10 










15 


Pro 


Glu 


Cys 


Thr 
20 


Leu 


Gin 


Glu 


Asn 


Pro 
25 


Phe 


Phe 


Ser 


Gin 


Pro 
30 


Gly Ala 


Pro 


He 


Leu 
35 


Gin 


Cys 


Met 


Gly 


Cys 
40 


Cys 


Phe 


Ser 


Arg 


Ala 
45 


Tyr 


Pro Thr 


Pro 


Leu 
50 


Arg 


Ser 


Lys 


Lys 


Thr 
55 


Met 


Leu 


Val 


Gin 


Lys 
60 


Gin 


Val 


Thr Ser 


Glu 


Ser 


Thr 


Cys 


Cys 


Val 


Ala 


Lys 


Ser 


Tyr 


Asn 


Arg 


Val 


Thr 


Val Met 


65 










70 










75 








80 


Gly 


Gly 


Phe 


Lys 


Val 
85 


Glu 


Gin 


His 


Thr 


Ala 
90 


Cys 


His 


Cys 


Ser 


Thr Cys 
95 


Tyr 


Tyr 


His 


Lys 
100 


Ser 























(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 317 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

GTACCGGTAC CTTAAGATTT GTGATAATAA CAAGTACTGC AGTGGCACGC CGTGTGTTGC 60 

TCCACTTTGA AACCCCCCAT TACTGTGACC CTGTTATATG ATTTAGCTAC ACAGCAAGTG 120 

GACTCTGAGG TGACTTGCTT TTGGACCAAC ATCGTCTTCT TGGACCTTAG TGGAGTGGGA 180 

TATGCTCTAG AGAAGCAGCA GCCCATGCAC TGAAGTATTG GGGCACCCGG CTGGGAGAAG 240 

AATGGGTTTT CCTGTAGCGT GCATTCTGGG CAATCCTGCA CATCAGGAGC GCTACCAGAT 300 

CCGCTACCGG ATCCGCA 317 

(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 



29 



(A) LENGTH: 575 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME /KEY : Coding Sequence 

(B) LOCATION: 33... 575 
(D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

ATGAAATCGA CGGAATCAGA CTCGAGCCAA GG ATG GAG ATG TTC CAG GGG CTG 53 

Met Glu Met Phe Gin Gly Leu 
1 5 

CTG CTG TTG CTG CTG CTG AGC ATG GGC GGG ACA TGG GCA TCC AAG GAG 101 
Leu Leu Leu Leu Leu Leu Ser Met Gly Gly Thr Trp Ala Ser Lys Glu 
10 15 20 

CCG CTT CGG CCA CGG TGC CGC CCC ATC CAA GCC ACC CTG GCT GTG GAG 14 9 

Pro Leu Arg Pro Arg Cys Arg Pro lie Gin Ala Thr Leu Ala Val Glu 
25 30 35 

AAG GAG GGC TGC CCC GTG TGC ATC ACC GTC AAC ACC ACC ATC TGT GCC 197 
Lys Glu Gly Cys Pro Val Cys lie Thr Val Asn Thr Thr lie Cys Ala 
40 45 50 55 

GGC TAC TGC CCC ACC ATG ACC CGC GTG CTG CAG GGG GTC CTG CCG GCC 245 
Gly Tyr Cys Pro Thr Met Thr Arg Val Leu Gin Gly Val Leu Pro Ala 
60 65 70 

CTG CCT CAG GTG GTG TGC AAC TAC CGC GAT GTG CGC TTC GAG TCC ATC 293 
Leu Pro Gin Val Val Cys Asn Tyr Arg Asp Val Arg Phe Glu Ser lie 
75 80 85 

CGG CTC CCT GGC TGC CCG CGC GGC GTG AAC CCC GTG GTC TCC TAC GCC 341 
Arg Leu Pro Gly Cys Pro Arg Gly Val Asn Pro Val Val Ser Tyr Ala 
90 95 100 

GTG GCT CTC AGC TGT CAA TGT GCA CTC TGC CGC CGC AGC ACC ACT GAC 38 9 

Val Ala Leu Ser Cys Gin Cys Ala Leu Cys Arg Arg Ser Thr Thr Asp 
105 110 115 

TGC GGG GGT CCC AAG GAC CAC CCC TTG ACC TGT GAT GAC CCC CGC TTC 437 
Cys Gly Gly Pro Lys Asp His Pro Leu Thr Cys Asp Asp Pro Arg Phe 
120 125 130 135 

CAG GAC TCC TCT TCC TCA AAG GCC CCT CCC CCC AGC CTT CCA AGC CCA 4 85 

Gin Asp Ser Ser Ser Ser Lys Ala Pro Pro Pro Ser Leu Pro Ser Pro 
140 145 150 

TCC CGA CTC CCG GGG CCC TCG GAC ACC CCG ATC CTC CCC CAA GGA TCC 533 
Ser Arg Leu Pro Gly Pro Ser Asp Thr Pro lie Leu Pro Gin Gly Ser 
155 160 165 
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GGT AGC GGA TCT GGT AGC GCT CCT GAT GTG CAG GAT TGC CCA 575 
Gly Ser Gly Ser Gly Ser Ala Pro Asp Val Gin Asp Cys Pro 
170 175 180 



(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 181 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 



Met 


Glu 


Met 


Phe 


Gin 


Gly 


Leu 


Leu 


Leu 


Leu 


Leu 


Leu 


Leu 


Ser 


Met 


Gly 


1 








5 










10 










15 




Gly 


Thr 


Trp 


Ala 


Ser 


Lys 


Glu 


Pro 


Leu 


Arg 


Pro 


Arg 


Cys 


Arg 


Pro 


He 








20 










25 










30 






Gin 


Ala 


Thr 


Leu 


Ala 


Val 


Glu 


Lys 


Glu 


Gly 


Cys 


Pro 


Val 


Cys 


He 


Thr 






35 










40 










45 








Val 


Asn 


Thr 


Thr 


He 


Cys 


Ala 


Gly 


Tyr 


Cys 


Pro 


Thr 


Met 


Thr 


Arg 


Val 




50 










55 










60 










Leu 


Gin 


Gly 


Val 


Leu 


Pro 


Ala 


Leu 


Pro 


Gin 


Val 


Val 


Cys 


Asn 


Tyr 


Arg 


65 










70 










75 










80 


Asp 


Val 


Arg 


Phe 


Glu 


Ser 


He 


Arg 


Leu 


Pro 


Gly 


Cys 


Pro 


Arg 


Gly Val 










85 










90 










95 




Asn 


Pro 


Val 


Val 


Ser 


Tyr 


Ala 


Val 


Ala 


Leu 


Ser 


Cys 


Gin 


Cys 


Ala 


Leu 








100 










105 










110 






Cys 


Arg 


Arg 


Ser 


Thr 


Thr 


Asp 


Cys 


Gly 


Gly 


Pro 


Lys 


Asp 


His 


Pro 


Leu 






115 










120 










125 








Thr 


Cys 


Asp 


Asp 


Pro 


Arg 


Phe 


Gin 


Asp 


Ser 


Ser 


Ser 


Ser 


Lys 


Ala 


Pro 




130 










135 










140 










Pro 


Pro 


Ser 


Leu 


Pro 


Ser 


Pro 


Ser 


Arg 


Leu 


Pro 


Gly 


Pro 


Ser 


Asp 


Thr 


145 










150 










155 










160 


Pro 


He 


Leu 


Pro 


Gin 


Gly 


Ser 


Gly 


Ser 


Gly 


Ser 


Gly 


Ser 


Ala 


Pro 


Asp 










165 










170 










175 




Val 


Gin 


Asp 


Cys 


Pro 

























180 

(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 549 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

TGGGCAATCC TGC AC AT CAG GAGCGCTACC AGATCCGCTA CCGGATCCTT GGGGGAGGAT 60 

CGGGGTGTCC GAGGGCCCCG GGAGTCGGGA TGGGCTTGGA AGGCTGGGGG GAGGGGCCTT 120 

TGAGGAAGAG GAGTCCTGGA AGCGGGGGTC ATCACAGGTC AAGGGGTGGT CCTTGGGACC 180 





CCCGCAGTCA GTGGTGCTGC GGCGGCAGAG TGCACATTGA CAGCTGAGAG CCACGGCGTA 24 0 

GGAGACCACG GGGTTCACGC CGCGCGGGCA GCCAGGGAGC CGGATGGACT CGAAGCGCAC 300 

ATCGCGGTAG TTGCACACCA CCTGAGGCAG GGCCGGCAGG ACCCCCTGCA GCACGCGGGT 360 

CATGGTGGGG CAGTAGCCGG CACAGATGGT GGTGTTGACG GTGATGCACA CGGGGCAGCC 420 

CTCCTTCTCC ACAGCCAGGG TGGCTTGGAT GGGGCGGCAC CGTGGCCGAA GCGGCTCCTT 4 80 

GGATGCCCAT GTCCCGCCCA TGCTCAGCAG CAGCAACAGC AGCAGCCCCT GGAACATCTC 54 0 

CATCCTTGG 54 9 



(2) INFORMATION FOR SEQ ID NO: 38: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 837 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(ix) FEATURE: 



(A) NAME /KEY : Coding Sequence 

(B) LOCATION: 33. . .827 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 



ATGAAATCGA CGGAATCAGA CTCGAGCCAA GG ATG GAG ATG TTC CAG GGG CTG 53 

Met Glu Met Phe Gin Gly Leu 
1 5 

CTG CTG TTG CTG CTG CTG AGC ATG GGC GGG ACA TGG GCA TCC AAG GAG 101 

Leu Leu Leu Leu Leu Leu Ser Met Gly Gly Thr Trp Ala Ser Lys Glu 

10 15 20 

CCG CTT CGG CCA CGG TGC CGC CCC ATC AAT GCC ACC CTG GCT GTG GAG 14 9 

Pro Leu Arg Pro Arg Cys Arg Pro lie Asn Ala Thr Leu Ala Val Glu 
25 30 35 

AAG GAG GGC TGC CCC GTG TGC ATC ACC GTC AAC ACC ACC ATC TGT GCC 197 
Lys Glu Gly Cys Pro Val Cys lie Thr Val Asn Thr Thr lie Cys Ala 
40 45 50 55 

GGC TAC TGC CCC ACC ATG ACC CGC GTG CTG CAG GGG GTC CTG CCG GCC 24 5 

Gly Tyr Cys Pro Thr Met Thr Arg Val Leu Gin Gly Val Leu Pro Ala 
60 65 70 

CTG CCT CAG GTG GTG TGC AAC TAC CGC GAT GTG CGC TTC GAG TCC ATC 293 
Leu Pro Gin Val Val Cys Asn Tyr Arg Asp Val Arg Phe Glu Ser lie 
75 80 85 

CGG CTC CCT GGC TGC CCG CGC GGC GTG AAC CCC GTG GTC TCC TAC GCC 341 
Arg Leu Pro Gly Cys Pro Arg Gly Val Asn Pro Val Val Ser Tyr Ala 
90 95 100 

GTG GCT CTC AGC TGT CAA TGT GCA CTC TGC CGC CGC AGC ACC ACT GAC 389 
Val Ala Leu Ser Cys Gin Cys Ala Leu Cys Arg Arg Ser Thr Thr Asp 
105 110 115 

TGC GGG GGT CCC AAG GAC CAC CCC TTG ACC TGT GAT GAC CCC CGC TTC 4 37 
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Cys Gly Gly Pro Lys Asp His Pro Leu Thr Cys Asp Asp Pro Arg Phe 
120 125 130 135 

CAG GAC TCC TCT TCC TCA AAG GCC CCT CCC CCC AGC CTT CCA AGC CCA 4 85 

Gin Asp Ser Ser Ser Ser Lys Ala Pro Pro Pro Ser Leu Pro Ser Pro 
140 145 150 

TCC CGA CTC CCG GGG CCC TCG GAC ACC CCG ATC CTC CCC CAA GGA TCC 533 
Ser Arg Leu Pro Gly Pro Ser Asp Thr Pro lie Leu Pro Gin Gly Ser 
155 160 165 

GGT AGC GGA TCT GGT AGC GCT CCT GAT GTG CAG GAT TGC CCA GAA TGC 581 
Gly Ser Gly Ser Gly Ser Ala Pro Asp Val Gin Asp Cys Pro Glu Cys 
170 175 180 

ACG CTA CAG GAA AAC CCA TTC TTC TCC CAG CCG GGT GCC CCA ATA CTT 62 9 

Thr Leu Gin Glu Asn Pro Phe Phe Ser Gin Pro Gly Ala Pro lie Leu 
185 190 195 

CAG TGC ATG GGC TGC TGC TTC TCT AGA GCA TAT CCC ACT CCA CTA AGG 677 
Gin Cys Met Gly Cys Cys Phe Ser Arg Ala Tyr Pro Thr Pro Leu Arg 
200 205 210 215 

TCC AAG AAG ACG ATG TTG GTC CAA AAG CAA GTC ACC TCA GAG TCC ACT 725 
Ser Lys Lys Thr Met Leu Val Gin Lys Gin Val Thr Ser Glu Ser Thr 
220 225 230 

TGC TGT GTA GCT AAA TCA TAT AAC AGG GTC ACA GTA ATG GGG GGT TTC 7 73 

Cys Cys Val Ala Lys Ser Tyr Asn Arg Val Thr Val Met Gly Gly Phe 
235 240 245 

AAA GTG GAG CAA CAC ACG GCG TGC CAC TGC AGT ACT TGT TAT TAT CAC 821 
Lys Val Glu Gin His Thr Ala Cys His Cys Ser Thr Cys Tyr Tyr His 
250 255 260 

AAA TCT TAAGTTAACC 837 
Lys Ser 
265 



(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 65 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

Met Glu Met Phe Gin Gly Leu Leu Leu Leu Leu Leu Leu Ser Met Gly 

15 10 15 

Gly Thr Trp Ala Ser Lys Glu Pro Leu Arg Pro Arg Cys Arg Pro lie 
20 25 30 
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(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 835 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

TCCGGATTAG CTTGAGATGG ATCCGGTTAA CTTAAGATTT GTGATAATAA CAAGTACTGC 60 

AGTGGCACGC CGTGTGTTGC TCCACTTTGA AACCCCCCAT TACTGTGACC CTGTTATATG 12 0 

ATTTAGCTAC ACAGCAAGTG GACTCTGAGG TGACTTGCTT TTGGACCAAC ATCGTCTTCT 180 

TGGACCTTAG TGGAGTGGGA TATGCTCTAG AGAAGCAGCA GCCCATGCAC TGAAGTATTG 24 0 

GGGCACCCGG CTGGGAGAAG AATGGGTTTT CCTGTAGCGT GCATTCTGGG CAATCCTGCA 300 

CAT C AG GAG C GCTACCAGAT CCGCTACCGG ATCCTTGGGG GAGGATCGGG GTGTCCGAGG 360 

GCCCCGGGAG TCGGGATGGG CTTGGAAGGC TGGGGGGAGG GGCCTTTGAG GAAGAGGAGT 420 

CCTGGAAGCG GGGGTCATCA CAGGTCAAGG GGTGGTCCTT GGGACCCCCG CAGTCAGTGG 48 0 

TGCTGCGGCG GCAGAGTGCA CATTGACAGC TGAGAGCCAC GGCGTAGGAG ACCACGGGGT 54 0 

TCACGCCGCG CGGGCAGCCA GGGAGCCGGA TGGACTCGAA GCGCACATCG CGGTAGTTGC 600 

ACACCACCTG AGGCAGGGCC GGCAGGACCC CCTGCAGCAC GCGGGTCATG GTGGGGCAGT 660 

AGCCGGCACA GATGGTGGTG TTGACGGTGA TGCACACGGG GCAGCCCTCC TTCTCCACAG 720 

CCAGGGTGGC ATTGATGGGG CGGCACCGTG GCCGAAGCGG CTCCTTGGAT GCCCATGTCC 780 

CGCCCATGCT CAGCAGCAGC AACAGCAGCA GCCCCTGGAA CATCTCCATC CTTGG 835 




(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41 
GGAGGAAGGG TGGTCGACCT CTCTGGT 

(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42 
CACATCAGGA GCTTGTGGGA GGATCGG 

(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43 
ATCCTCCCAC AAGCTCCTGA TGTGCAG 

(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44 
TGAGTCGACA TGATAATTCA GTGATTGAAT 

(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 55 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 



ATGAAATCGA CGGAATCAGA CTCGAGCCAA G GAT G GAG AT GTTCCAGGGG CTGCT 



55 



(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 
GCTACCAGAT CCGCTACCGG ATCCTTGGGG GAGGAT.CGGG GTGTCCGAGG G 51 
(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 
GGATCCGGTA GCGGATCTGG TAGCGCTCCT GATGTGCAGG ATTGCCCA 48 
(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 
TCCGGATTAG CTTGAGATGG ATCCGGTACC TTAAGATTTG TGATAATAAC AAGTACTGCA 60 
(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 
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ATGAAATCGA CGGAATCAGA CTCGAGCCAA GG 
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(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 
TCCGGATTAG CTTGAGATGG ATCCGGTACC TTA 33 
(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:51: 

Met Glu Met Phe Gin Gly Leu Leu Leu Leu Leu Leu Leu Ser Met Gly 

15 10 15 

Gly Thr Trp Ala 
20 

(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:52: 

Gly Ser Gly Ser Gly Ser Gly Ser 
1 5 

(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 
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GCTACCGGAT CCCCGCGGGT CATCACAGGT CAAGGGGTGG T 



41 



(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 
ATGAAATCGA CGGAATCAGA CTCGAGCCAA GGAATGGAGA TGCTCCAGGG GCTGCT 56 
(2) INFORMATION FOR SEQ ID NO:55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 
GCTACCAGAT CCGCTACCGG ATCCTTGGGG GTGGTCACAG GTCAAGGGGT G 51 
(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 

Met Glu Met Leu Gin Gly Leu Leu Leu Leu Leu Leu Leu Ser Met Gly 

15 10 15 

Gly Ala Trp Ala 
20 

(2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 
ATGAAATCGA CGGAATCAGA CTCGAGCCAA GGATGAAGAC ACTCCAGTTT TTCTTCC 57 
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(2) INFORMATION FOR SEQ ID NO: 58: 

(i). SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 
ACCAGATCCG CTACCGGATC CTTCTTTCAT TTCACCAAAG GAGCAG 4 6 

(2) INFORMATION FOR SEQ ID NO: 59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 

Met Lys Thr Leu Gin Phe Phe Phe Leu Phe Cys Cys Trp Lys Ala lie 
15 10 15 



(2) INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 111 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: ' 

GCTACCGGAT CCTTCTTTCA TTTCACCAAA G GAGCAG TAG CTGGGCCCCA GGCCTCGCAC 60 
AGTACAATCA GTGCTGTCGC TGTCGCAGAG TGCACATTGA CAGCTGACAG C 111 

(2) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 87 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 

GCTACCGGAT CCTTCTTTCA TTTCACCAAA GGAGCAGTAG CTGGGCCCCA GGCCTCGCAC 60 
AGTGCAGTCA GTGGTGCTGC GGCGGCA 87 



Cys Cys 




t 



(2) INFORMATION FOR SEQ ID NO: 62: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 78 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 

GCTACCGGAT CCTTCACCAA AGGAGCAGTA GCTGGGCCCC AGGCCTCGCA CAGTGCAGTC 60 
AGTGGTGCTG CGGCGGCA 7 8 

(2) INFORMATION FOR SEQ ID NO: 63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 78 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 

GCTACCGGAT CCCCGCGGGT CATCGCAGTA GCTGGGCCCC AGGCCTCGCA CAGTGCAGTC 60 
AGTGGTGCTG CGGCGGCA 78 

(2) INFORMATION FOR SEQ ID NO: 64: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 70 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 

TGCTTCTCTA GAGCATATCC CACTCCACTA AGGTCCAAGA AGACGATGTT GGTCCAAAAG 60 
CAAGTCACCT 70 

(2) INFORMATION FOR SEQ ID NO: 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 73 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 

GTACCGGTAC CTTAAGATTT GTGATAATAA CAAGTACTGC AGTGGCACGC CGTGTGTTGC 60 
TCCACTTTGA AAC 73 



(2) INFORMATION FOR SEQ ID NO: 66: 




• i 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
CD) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 
CGGGGTAGGT TCGGTGGGAC CGACACCTCT TCCTCCCGAC GGGG 4 4 

(2) INFORMATION FOR SEQ ID NO: 67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67: 
CTACCACCAC AACTGCCACT ACGTGTGCCC CGTCGGGAGG AAGAGGTG 4 8 

(2) INFORMATION FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 
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(2) INFORMATION FOR SEQ ID NO: 69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 114 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 
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(2) INFORMATION FOR SEQ ID NO:70: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 93 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:70: 
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(2) INFORMATION FOR SEQ ID NO: 71: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 114 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 
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70 










75 










80 


Ser 


Phe 


Pro 


Val 


Ala 
85 


Leu 


Ser 


Cys 


Arg 


Cys 
90 


Gly 


Pro 


Cys 


Arg 


Arg 
95 


Ser 


Thr 


Ser 


Asp 


Cys 
100 


Gly 


Gly 


Pro 


Lys 


Asp 
105 


His 


Pro 


Leu 


Thr 


Cys 
110 


Asp 


His 



Pro Gin 



(2) INFORMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 111 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 



Asn 


Ser 


Cys 


Glu 


Leu 


Thr 


Asn 


He 


Thr 


lie 


Ala 


Val 


Glu 


Lys 


Glu 


Gly 


1 








5 










10 










15 




Cys 


Gly 


Phe 


Cys 
20 


He 


Thr 


He 


Asn 


Thr 
25 


Thr 


Trp 


Cys 


Ala 


Gly 
30 


Tyr 


Cys 


Tyr 


Thr 


Arg 
35 


Asp 


Leu 


Val 


Tyr 


Lys 
40 


Asp 


Pro 


Ala 


Arg 


Pro 
45 


Lys 


lie 


Gin 


Lys 


Thr 
50 


Cys 


Thr 


Phe 


Lys 


Glu 
55 


Leu 


Val 


Tyr 


Glu 


Thr 
60 


Val 


Arg 


Val 


Pro 


Gly. 


Cys 


Ala 


His 


His 


Ala 


Asp 


Ser 


Leu 


Tyr 


Thr 


Tyr 


Pro 


Val 


Ala 


Thr 


65 










70 










75 










80 


Gin 


Cys 


His 


Cys 


Gly 
85 


Lys 


Cys 


Asp 


Ser 


Asp 
90 


Ser 


Thr 


Asp 


Cys 


Thr 
95 


Val 


Arg 


Gly 


Leu 


Gly 
100 


Pro 


Ser 


Tyr 


Cys 


Ser 
105 


Phe 


Gly 


Glu 


Met 


Lys 
110 


Glu 





(2) INFORMATION FOR SEQ ID NO: 73: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 108 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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IF 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 



Asn 








T,pn 


Thr 

X Hi 


no i i 


lie 

X X c 


Thr 

X 1 IX, 


lie 

X X C 


Ala 


Val 


Glu 


Lys 


Glu 


Glv 

vjx y 


1 


















1 0 

X w 










X sj 




Cys 


Gly 


Phe 


Cys 
20 


He 


Thr 


lie 


Asn 


Thr 
25 


Thr 


Trp 


Cys 


Ala 


Gly 
30 


Tyr 


Cys 


Tyr 


Thr 


Arg 


Asp 


Leu 


Val 


Tyr 


Lys 
a n 


Asp 


Pro 


Ala 


Arg 


Pro 

A S 


Lys 


He 


Gin 


Lys 


Thr 
50 


Cys 


Thr 


Phe 


Lys 


Glu 
55 


Leu 


Val 


Tyr 


Glu 


Thr 
60 


Val 


Arg 


Val 


Pro 


Gly 


Cys 


Ala 


His 


His 


Ala 


Asp 


Ser 


Leu 


Tyr 


Thr 


Tyr 


Pro 


Val 


Ala 


Thr 


65 










70 










75 










80 


Gin 


Cys 


His 


Cys 


Gly 
85 


Lys 


Cys 


Asp 


Ser 


Asp 
90 


Ser 


Thr 


Asp 


Cys 


Thr 
95 


Val 


Arg 


Gly 


Leu 


Gly 
100 


Pro 


Ser 


Tyr 


Cys 


Ser 
105 


Phe 


Gly 


Glu 











(2) INFORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 104 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 



Asn 


Ser 


Cys 


Glu 


Leu 


Thr 


Asn 


He 


Thr 


He 


Ala 


Val 


Glu 


Lys 


Glu 


Gly 


1 








5 










10 










15 




Cys 


Gly 


Phe 


Cys 
20 


He 


Thr 


He 


Asn 


Thr 
25 


Thr 


Trp 


Cys 


Ala 


Gly 
30 


Tyr 


Cys 


Tyr 


Thr 


Arg 
35 


Asp 


Leu 


Val 


Tyr 


Lys 
40 


Asp 


Pro 


Ala 


Arg 


Pro 
45 


Lys 


He 


Gin 


Lys 


Thr 
50 


Cys 


Thr 


Phe 


Lys 


Glu 
55 


Leu 


Val 


Tyr 


Glu 


Thr 
60 


Val 


Arg 


Val 


Pro 


Gly 


Cys 


Ala 


His 


His 


Ala 


Asp 


Ser 


Leu 


Tyr 


Thr 


Tyr 


Pro 


Val 


Ala 


Thr 


65 










70 










75 










80 


Gin 


Cys 


His 


Cys 


Gly 
85 


Lys 


Cys 


Asp 


Ser 


Asp 
90 


Ser 


Thr 


Asp 


Cys 


Thr 
95 


Val 


Arg 


Gly 


Leu 


Gly 
100 


Pro 


Ser 


Tyr 


Cys 



















(2) INFORMATION FOR SEQ ID NO: 75: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 

Asp Ser Asp Ser Thr Asp Cys Thr Val Arg Gly Leu Gly Pro Ser Tyr 
15 10 15 
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Cys Ser Phe Gly Glu Met Lys Glu 
20 

(2) INFORMATION FOR SEQ ID NO: 76: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76: 

Thr Val Arg Gly Leu Gly Pro Ser Tyr Cys Ser Phe Gly Glu Met Lys 
15 10 15 



(2) INFORMATION FOR SEQ ID NO:77: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 

Thr Val Arg Gly Leu Gly Pro Ser Tyr Cys Ser Phe Gly Glu 
15 10 

(2) INFORMATION FOR SEQ ID NO: 78: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78: 

Thr Val Arg Gly Leu Gly Pro Ser Tyr 
1 5 

(2) INFORMATION FOR SEQ ID NO: 79: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 92 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 



Glu 



45 







nu .rloLj v aJL 








n X KJ 






Th r 

1 I IX 


Toil 


OX 11 


ox u. 




Prn 

i- X C 


1 
X 




c 










1 0 










X 




Phe 


rue ocx uiii 


Prn 




AI a 


n x \j 


lie 


T .on 


om 










^y o 






























Phe 


Ser Arg Ala 


Tvr 


Pro 


Thr 


Pro 


Leu 


Arg 


Ser 


Lys 


Lys 


Thr 


Met 


Leu 




35 








40 










45 








Val 


Gin Lys Asn 


Val 


Thr 


Ser 


Glu 


Ser 


Thr 


Cys 


Cys 


Val 


Ala 


Lys 


Ser 




50 






55 










60 










Tyr 


Asn Arg Val 


Thr 


Val 


Met 


Gly 


Gly 


Phe 


Lys 


Val 


Glu 


Asn 


His 


Thr 


65 






70 










75 










80 


Ala 


Cys His Cys 


Ser 


Thr 


Cys 


Tyr 


Tyr 


His 


Lys 


Ser 











85 90 



(2) INFORMATION FOR SEQ ID NO: 80: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 

Gly Ser Gly Ser 
1 

(2) INFORMATION FOR SEQ ID NO: 81: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

( D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81: 

Gly Ser Gly Ser Gly Ser 
1 5 

(2) INFORMATION FOR SEQ ID NO: 82: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82: 

Gly Ser Gly Ser Gly Ser Gly Ser Gly Ser 
15 10 



(2) INFORMATION FOR SEQ ID NO: 83: 
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# 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83 



Asp Asp Pro Arg 
1 



